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ABSTRACT 


This thesis presents a praetical method for ealeulating and representing entropy- 
based metries for a set of bibliographie records evolving over time, in support of Dr. 
Michael Saboe’s dissertation research which addressed the ability to measure software 
technology transfer. The implementation of the analysis methodology for determining the 
information-temperature of evolving datasets containing bibliographic records is 
described. The information-temperature metric is based on information entropy and is 
used to relate the maximum complexity of a system to the current complexity. 

The implementation of the analysis methodology required using data mining 
techniques to prepare the datasets. Additionally, since the information-temperature metric 
derived from Saboe’s work was a new emerging concept, the data analysis methodology 
had to be refined several times in order to obtain the desired results. An iterative software 
development paradigm was used to write the application in 3 iterations using Visual 
Basic. 

At the end of the implementation the data analysis process became systemized 
allowing the outlining of the steps to compute the temperature of datasets, and it is 
estimated that the learning curve of the analysis can be reduced by 50 percent through 
integration and packing of the analysis functions into a stand-alone application with an 
intuitive user interface. 
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I. INTRODUCTION 


A. BACKGROUND 

The information in this thesis includes an explanation of the problem the author 
worked on, the methods in which the author used to research and alleviate the problem, 
and the results achieved at the conclusion of the research. 

This thesis presents a practical method for calculating and representing entropy- 
based metrics for a set of bibliographic records evolving over time in support of Dr. 
Michael Saboe’s dissertation research which addressed the ability to measure software 
technology transfer. Saboe’s goal was to create mechanisms utilizing information theory, 
communication theory, chaos control theory, and learning curve principles to determine 
and predict the evolution of technology. The term entropy comes from the field of 
information theory and appears throughout this thesis. Entropy in this context is 
described below in the following excerpt from Saboe’s dissertation: 

Entropy as a concept can readily be seen as logical entropy (think 
of it as a measure of uncertainty, noise, non-signal, process inefficiencies, 
the percentage of work resulting in defects and requiring rework, etc) and 
physical or thermodynamic entropy (i.e. mixed-up-ed-ness, disorder, 
disorganization, etc), which is the quantity of energy not available to do 

work. Eogical entropy is Shannon's entropy as defined by Shannon 
on his treatise on communication theory (Shannon 1948). Shannon’s 
theory says that the entropy of an information source measures how well 
its behavior (e.g. the next symbol in a sequence it produced) can be 
predicted. 

The definition of entropy here is related to the definition of entropy 
in thermodynamics. What follows is a basic review of entropy in 
information theory after Shannon (1948). Eet X be a discrete random 
variable with alphabet 5"and a probability mass function p(x)=Vx{X=x}, 

X eS. p(x) and p(y) refer to two different random variables and are in fact 
two different probability mass functions px(x) and Py(y). 


The definition of information entropy is: 

Pi^) 



Sh is the entropy measured in bits, and the log is base 2. Log 2 will 
be assumed throughout unless otherwise noted. For example, the entropy 
of a fair eoin toss is 1 bit. The eonvention of 0^ log 0"^ ^0 is used, whieh 
eomes from eontinuity sinee x log x -^0^, as x -^0^. The base of the log is 
two for the natural units of information entropy as developed by Shannon 
(Shannon 1948). The entropy is a funetion of the distribution of X. It 
does not depend on the aetual values taken by the random variable X, but 
only on the probabilities. 


As Dr. Miehael Saboe reaehed the goal of his dissertation he developed a unit of 
measure dubbed “Saboe degrees”. A Saboe degree is an information-temperature 
measurement that was shown to be the eontrolling parameter of an evolving system. 
Saboe demonstrated that this information-temperature is derived from a system of 
equations whieh ineludes both an abstraet representation of a eonserved property 
(information in terms of primitive messages) and entropy (in information units of bits). 
This was demonstrated through the results of applying several ealeulations and 
transformations to datasets eontaining bibliographie reeords. Saboe deseribes the 
signifieanee of the temperature metrie in the following passage from his dissertation; 

Temperature is signifieant beeause it relates the maximum 
eomplexity of a system to the eurrent eomplexity. This is a proven metrie 
that ean be applied in many plaees to software engineering, e.g. software 
eomplexity. A direet relationship ean be easily made to Halstead’s metrie 
whieh is familiar to software engineers. This in turn has been related to the 
rate humans are eapable of making deeisions between two ehoiees, e.g. 
alphabet sets of sets of operator and operands, operators and edges, 
operators and flows. 


The information-temperature metrie is also used to deseribe teehnology transfer. 
The Pressure Law states that at eonstant volume, pressure is direetly proportional to 
temperature meaning as pressure inereases, temperature inereases. Saboe defined 
pressure as the number of <messages> proeessed per node, where the <messages> 
represents the average output in a time step per node. A node is eonsidered a publishing 
organization in the eontext of data presented in this thesis. Thus the temperature inereases 
as the amount of published information inereases. Theories exist stating that teehnology 
evolution ean be determined by the rate and the amount of published information over 
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time. These theories are out of the seope of this thesis, but a list of theories noted by Dr. 
Saboe is found in Appendix Q. With the relationship between temperature and pressure 
established a higher temperature means a teehnology is evolving quieker than one with a 
lower temperature. 

Methods from the field of data mining were used to obtain and prepare the 
bibliographie datasets, and Saboe’s methodology and temperature metrie were applied to 
the data by the author of this thesis to validate Saboe’s theory. Additionally, this thesis 
centers on the development of a software application to solve the following problem: 

A solution needs to exist that will preserve the calculation and data processing 
methods that produced the data analysis results shown in Dr. Michael Saboe’s doctoral 
dissertation. This solution must increase the efficiency of the current data analysis 
process by minimizing the amount of user effort required to complete the analysis tasks. 

B, RESEARCH QUESTIONS 

This thesis focuses on providing answers to the following questions: 

1. Can the data analysis process be systemized? 

2. What are the steps required to determine the temperature of a set of 
bibliographic records? 

3. Can the learning curve of the process be reduced? 

C, SCOPE, LIMITATIONS, AND ASSUMPTIONS 

The main outputs of this thesis are the requirements, design and architecture for 
the implementation of a software application named DataThermometer. 
DataThermometer is a data mining analysis tool that determines a bibliographic dataset’s 
temperature. The actual implementation of a ready-for-release version will not be created 
for this thesis due to time and man-power constraints. 

D, METHODOLOGY 

The author is an employee in the Next Generation Software Technology Area 
(NextGen), an organization within the US Army’s Tank Automotive and Armaments 
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Command (TACOM), who worked with Dr. Michael Saboe while he was the Associate 
Director of NextGen. The work involved reducing and analyzing data to be used in 
Saboe’s doctoral dissertation. With that, the methods the author used in attacking this 
project included consulting experts in the data mining field, interviewing co-workers, 
reviewing literature concerning data mining and software development, and attending 
Software Engineering courses at the Naval Postgraduate School. All the research and 
work performed has been done within the last 18 months. 

Two major challenges were encountered and overcome during the completion of 
this thesis. First, the information-temperature metric derived from Saboe’s work was a 
new emerging concept. Being a new concept, the data analysis methodology had to be 
refined several times in order to obtain the desired results. The author chose to implement 
an iterative software development paradigm to facilitate the development of the evolving 
data analysis methods. Second, the large volume of data to be analyzed required the use 
of data mining techniques to prepare it for processing. 

E, ORGANIZATION 

The following chapters describe major specific steps of the procedure used in 
solving the problem. First, Chapter II presents an overview of data mining and its 
applicability to this thesis. Secondly, Chapter III explains the software development 
process utilized during the development of DataThermometer. Finally, the analysis and 
conclusions are presented in Chapter IV. 


4 



II. DATA MINING 


The data had to be prepared before any type of calculation algorithms could be 
applied. The raw data had to be transformed into a structured representation. The 
transformation involved gathering the raw data from online databases containing a 
collection of scholarly references. After obtaining the raw data, the record structure of the 
dataset was analyzed. From determining the structure of the raw data, the author was able 
to create a regular expression import filter to process the records. Next, the data was fed 
through the fdter where the data was processed and organized into a structured 
representation. Once the data was in this structured form, several activities followed to 
finish preparing the data. The activities come from the field of data mining. The next 
sections give a brief overview of data mining and are followed by a discussion of the data 
preparation activities used for this thesis. 


A, WHAT IS DATA MINING 

“Data Mining” is a term referring to a recently-born discipline of study. Being 
such a young term there is no agreed upon definition. The definition presented below is 
at a high enough level to effectively describe data mining. 

Data mining is the process of discovering meaningful new correlations, patterns 
and trends by sifting through large amount of data stored in repositories, using pattern 
recognition technologies as well as statistical and mathematical techniques (Gartner 
Group). [1] 

The term data mining can also be used to describe a variety of data processing 
tools and strategies that increase the utility of data stored in databases. While there is 
currently no universally accepted definition, the term is broadly used whenever a process 
or person attempts to discover knowledge buried within a database. 
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B. 


DATA MINING EVOLUTION 


Data mining has evolved over the past 40 years from the process of analyzing 
data. Figure 1, below, shows the evolution began with in the 1960’s with data collection 
and continues today with data mining. 



Figure 1. Evolution of data mining. [From 2] 

1, Data Collection 

In the 1960s development programs focused on creating applications to present 
standardized reports of in formation contained in databases. The applications retrieved 
and manipulated the raw data producing simple summaries to meet specific decision¬ 
making needs. 

2, Data Access 

During 1980s, the desire for frequent and individualized information requests 
brought upon queries (informational requests). Queries accessed databases in order to 
obtain answers to a specific question. The queries of this era were built into the database 
system by the system developers. This produced a substantial time gap between 
implementing the query and obtaining the answer from it. Many database old-timers can 
recall the interminable information delays from the lack of ad-hoc queries (see Data 
Queries below). [3] 
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3, Data Queries 

Later, in the 1990s, the need for quiek answers to spur-of-the-moment questions 
(ad hoe queries) arose. [3] Users wanted information to be “just-in-time” to eorrelate with 
their produetion and deeision-making proeesses. That meant that not all of the users’ 
informational needs eould be preprogrammed into the system. At this stage, users began 
to write their own queries to extraet the information that they needed from the database. 

4, Data Mining 

In the last few years, needs ehanged onee again. Companies realized that they had 
aeeumulated volumes of information; and, as a result, the seareh began for tools and 
techniques that automatically identify and find relationships within these huge volumes 
of data. The techniques took the responsibility of finding data relationships out of the 
user’s hands and left the decisions up to the software. [4] The removal of this tedious task 
from the user, allows users to focus their attention on other areas such as interpreting 
what the found relationships mean. 

Data mining tools were first developed to help scientists find meaningful 
relationships or patterns from huge amounts of data that, if done in a traditional way, 
would require much time and many resources to find. 

Data mining, in many ways, is fundamentally the adaptation of machine learning 
techniques to business applications. Data mining is best described as the union of 
historical and recent developments in statistics, AI, and machine learning. These 
techniques are then used together to study data and find previously-hidden trends or 
patterns within. Data mining is finding increasing acceptance in science and business 
areas which need to analyze large amounts of data to discover trends which they could 
not otherwise find. 

Table 1, below, outlines questions of each era, discussed above, along with 
enabling technologies used to find answers to those questions. 
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Evolutionary Step 

Business Question 

Enabling 

Technologies 

Product 

Providers 

Characteristics 

Data Collection 
(1960s) 

"What was my total 
revenue in the last 
five years?" 

Computers, tapes, 
disks 

IBM, CDC 

Retrospective, static data 
delivery 

Data Access 
(1980s) 

"What were unit 
sales in New 
England last 
March?" 

Relational databases 
(RDBMS), Structured 
Query Language 
(SQL), ODBC 

Oracle, 

Sybase, 

Informix, 

IBM, 

Microsoft 

Retrospective, dynamic 
data delivery at record 
level 

Data Warehousing 
(feDecision Support 
(1990s) 

"What were unit 
sales in New 
England last 

March? Drill down 
to Boston." 

On-line analytic 
processing (OLAP), 
multidimensional 
databases, data 
warehouses 

Pilot, 

Comshare, 

Arbor, 

Cognos, 

Microstrategy 

Retrospective, dynamic 
data delivery at multiple 
levels 

Data Mining 
(Emerging Today) 

"What’s likely to 
happen to Boston 
unit sales next 
month? Why?" 

Advanced algorithms, 
multiprocessor 
computers, massive 
databases 

Pilot, 
Lockheed, 
IBM, SGI, 

numerous 

startups 

(nascent 

industry) 

Prospective, proactive 
information delivery 


Table 1. Evolution of data mining techniques. [From 5] 


C. DATA MINING ROOTS 

As mentioned in the previous section, data mining is a multidisciplinary field at 
the intersection of classical statistics, artificial intelligence (AI), and machine learning. 
The longest of these three lines is classical statistics. Without statistics, there would be no 
data mining, as statistics lays the foundation of most technologies on which data mining 
is built upon. Classical statistics embrace concepts such as regression analysis, standard 
distribution, standard deviation, standard variance, cluster analysis, and confidence 
intervals, all of which are used to study data and their relationships. [6] Those concepts 
are the very building blocks that helped bring about more advanced statistical analysis 
methods. Within the heart of today's data mining tools and techniques, classical statistical 
analysis plays a significant role. 

Data mining's second longest family line is artificial intelligence. This discipline, 
which is built upon heuristics as opposed to statistics, attempts to apply human thought- 
like processing to statistical problems. Because this approach requires vast computer 
processing power, it was not practical until the early 1980s, when computers began to 
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offer useful power at reasonable prices. AI found a few applications at the very high end 
scientific/government markets, but the required supercomputers of the era priced AI out 
of the reach of virtually everyone else. [7] The notable exceptions were certain AI 
concepts which were adopted by some high-end commercial products, such as query 
optimization modules for Relational Database Management Systems. [6] 

The third family line of data mining is machine learning, which is more accurately 
described as the union of statistics and AI. While AI was not a commercial success, its 
techniques were largely co-opted by machine learning. As Figure 2 shows below, the 
technological revolution increased computational ability while decreasing hardware and 
software costs. Machine learning was able to take advantage of the ever-improving 
price/performance ratios offered by computers of the 1980s and 1990s and it found more 
applications because the entry price was much lower than AI. Machine learning could be 
considered an evolution of AI, because it blends AI heuristics with advanced statistical 
analysis. Machine learning attempts to let computer programs learn about the data they 
study, allowing them to adapt their actions accordingly to previous results. Machine 
learning is used to analyze imprecise, incomplete, and complex information and deduct or 
find important relationships or patterns. 



•Processors many times powerful than yesterday’s mainframes 
•Ine^ensive disks can store hundreds of gi^bytes 
•Desktop very powerful for analysis tools. 

• Server software inexpensive, powerful, easy to maintain 
•Hardware and software prices sharply lower 


Figure 2. Impact of technological revolution. [From 8] 

9 


D, 


WHY DATA MINING EXISTS TODAY 


As mentioned earlier, so mueh data is colleeted today. Many databases are now 
quantified in terabytes of data. These databases contain valuable information on 
customers, suppliers, employees, research publications, and general economic conditions. 
Substantial effort is required to turn the vast amount of information into something 
understandable. Finding some sort of meaning may be the difference between success 
and failure for many businesses. Additionally, the amount of complexity of data in 
corporate databases continues to grow at unprecedented rates and without data mining 
techniques and tools the ability of the analysts and managers to act and make strategic 
decisions is severely limited. 


E. APPLICATIONS OF DATA MINING 

Today, there are many applications of data mining. Figure 3 below shows the 
associated algorithms of each data mining application: information discovery, automated 
prediction of trends and behaviors and forensic analysis. 



Figure 3. Data mining applications and associated algorithms. [From 9] 
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Data mining “discovery” is the proeess of identifying new patterns without 
speeifie direction as to the exaet determining faetors. This is a time eonsuming process 
requiring multiple iterations as the seareh for patterns is not focused and may uncover 
spurious relationships. However, it is a very powerful method to identify current 
relationships. This “discovery” mining is espeeially useful in deeoding human gene 
sequenees. 

The seeond form is the automated prediction of trends and behaviors. In 
“predietive modeling,” the data mining tool identifies patterns and uses that information 
to prediet the future. This data mining applieation is particularly popular in customer 
retention programs, by segmenting the eustomer base, determining the most profitable 
eategory of eustomer, and predieting the ability to retain that customer. Using such data, 
a targeted program ean be implemented to inerease the probability of retaining the most 
profitable eustomer. 

The third form of data mining, known as “forensie analysis,” entails applying a 
pattern to the data (whieh may have been determined through a previous data mining 
exereise) in order to identify data anomalies. This applieation of data mining is 
prevalently used today by quality assuranee and eredit card fraud systems. 


F. DATA MINING PROCESS 

Every set of produets must have a defined proeess behind it. To discover 
knowledge by finding trend and patterns, predicting behavior, or finding anomalies in a 
series of data, data mining follows a multi-step proeess. These steps are shown below as 
the Knowledge Diseovery from Databases (KDD) proeess in Figure 4. 



Selection 


Pre¬ 
processing 


Data 


Trans¬ 

formation 


Data 

Mining 


Interpretation/ 
Evaluation 


Target 

Preprocessed 

Transformed 

k. 

Patterns 

Data 

Data 

Data 


-: 



-1 ^ 


Knowledge 


Figure 4. Overview of the steps eonstituting the KDD proeess. [From 10] 
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1 . 


Obtain Data 


The first step in the data mining proeess is the aequisition of data. The 
repositories of required data are usually disparate systems that include legacy databases, 
non-legacy databases, third party data sources (i.e. supplier databases), and even real¬ 
time transactions not currently stored. This data is then passed along to a central 
information repository called a data warehouse. At the data warehouse, the data is 
processed in the following ways: 

1. The data is translated to a format that the data warehouse comprehends. 

2. The data is cleansed to correct blank or invalid fields through a set of 
business rules which, for instance, may correct a missing zip code by 
marking the attribute as “blank”. 

3. The data is transformed to a standardized format, which for instance might 
translate gender data from one source of “M” to the standardized format of 
“male”. 

Once the data is processed, it is entered into the data warehouse and other 
respective data repositories 

2, Focus Data 

The second step involves selecting the target data through data reduction. This 
includes finding useful features to represent the data, depending on the goal of the task, 
and using dimensionality reduction or transformation methods to reduce the effective 
number of variables under consideration or to find invariant representations for the data. 
[ 10 ] 

During this step, the data mining algorithm is determined (such as, 
summarization, classification, regression, and clustering), and the purpose of the model is 
derived. 


3, Apply Data Mining Activities 

With reducing the data and determining the algorithm to be used, the data mining 
tool will begin its work of clustering, classifying and scoring records. In clustering, the 
data mining tool identifies distinct clusters, or groupings, of similar data records. Using 
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these elusters, the data mining tool then classifies or categorizes each record into a cluster 
and assigns a score indicating the probability of the record fitting into and exhibiting the 
behavior of the identified cluster. 

4, Learn from Results 

The final step involves interpreting the results and using the knowledge acquired. 
The initiator of the process must translate the findings into meaningful action. In the case 
of a learning system, a hybrid system that combines artificial intelligence with data 
mining to “learn”, the system incorporates the findings into its data warehouse to be used 
for future mining and prediction activities. In the case of an automated process, such as 
an e-commerce site, the findings may be used to present a specific product offering or 
discount to a current site visitor and potential customer. Similarly, if a degradation 
pattern is identified in an automated production quality monitoring and prediction system, 
the system might auto-alert the responsible production manager via e-mail or a page. If, 
however, the initiator is a business analyst, it is his/her responsibility to translate the 
discovered insight into meaningful action. The resulting patterns themselves add no value 
to the business or organization. Only changes enacted as a result of identifying and 
interpreting these data patterns and relationships add such value. 


G. NOT A PANACEA 

As hype regarding the promise of data mining grows in the business community, 
it is important to manage the expectations surrounding its immediate results. Companies 
must educate their consumers that data mining deals with human behavior and human 
interpretation of its results. [11] Other limitations include high costs to mine information, 
limited usability of expensive software tools, and the lack of efficient algorithms to 
search gigantic databases. 

H. GATHERING & ANALYZING THE DATA 

The raw datasets processed in this thesis were obtained through several online 

databases (e.g., IEEE’s INSPEC, U.S. Department of Commerce’s National Technical 
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Information Service). A sample of approximately 48,000 records was collected, each 
record represented abstracts centered on the following key words: 

• Software Engineering 

• Abstract Data Types 

• Ada 

• Java 

• Rate Monotonic Analysis 

• Software Cost Models 

• Software Work Breakdown Structures 

• Software Technology Transfer 

The data in raw form, shown in Figure 5, was delimited text with field indicators. 
Appendix F contains a list and description of the data fields for the INSPEC database, 
which is where most of the data came from. All of the databases had similar formats for 
raw data making them compatible. The author analyzed the raw records and selected the 
fields necessary to carry out temperature-entropy calculations. The fields chosen were the 
following: 

• Accession Number 

• Affiliation 

• Author 

• Descriptors 

• Identifier 

• Fanguage 

• Number of References 

• Source 

• Publication Year 

• Title 
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LA: English 
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Figure 5. Raw record extracted from INSPEC database. 


By following the data mining process described earlier, the next step after 
selecting the data fields was to represent them along with their data in a structured form. 
The implementation of this step is described within Iteration One in Chapter III. 
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III. SOFTWARE & METHODOLOGY DEVELOPMENT 


A, SOFTWARE DEVELOPMENT PROCESS 

The data analysis process and software were created using a prototyping and 
iterative design process. 

Prototyping and iterative design enable us to create vastly improved products in 
less time than with the waterfall methodology, provided that the role of prototyping is 
well understood, and that it is managed properly (Wilson). Additionally, in this approach, 
development is organized into a series of short, fixed-length mini-projects called 
iterations; the outcomes of each is a tested, integrated, and executable system. [12] 

The advantages to this approach as described by Wilson are the following: 

• Early visualization of the product 

• Crisp definition of requirements 

• Early user testing 

• Enhanced communication within the development organization 

• Enhanced feedback to users. 

The stages of the iterative/prototyping process are shown in Eigure 6 and 
described below. The four stages have activities that map directly to the set of activities 
performed by a team of systems engineers when they set out to create a computer system 
solution. These activities remain fairly consistent from one project to another. [13] The 
activities are also discussed below under each iteration stage. 



Eigure 6. Iterative Development Cycle. [Erom 15] 
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1 . 


Plan 


The purpose of the planning stage is to determine the user’s needs and figure out 
how those needs will be addressed. Planning encompasses the activities of requirements 
gathering, analysis and design. 

a. Requirements Gathering 

Requirements gathering is the task of documenting the functions that the 
application should perform for the users in the users’ language and from the users’ 
perspective. 

Users view computer systems as black boxes meaning users care very 
little about the inner-workings of the computer system. Users want to be able to provide 
input to the system and obtain the output they expect. With this in mind requirements 
documentation that puts everything in the context of “going in” and “coming out” has 
more relevance to the users who study it. [13] 

b. Analysis 

Analysis encompasses the building of a logical solution that satisfies the 
requirements but does not necessarily take the physical constraints into account. 

c. Design 

Design takes the logical solution from the analysis and changes it to work 

effectively with the physical constraints (storage, database performance, caching, 
execution time, and so forth). The final output of design is a set of specifications that can 
direct the construction effort. 

2. Implement 

During implementation, a prototype is built to test the solutions developed during 
the planning stage. The activity, construction, is seen here. 

a. Construction 

Construction uses the design to produce working code, which involves 
making the lowest-level design decisions, writing code, compiling, debugging, and 
testing by increment. [13] 
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3. 


Measure 


Wilson points out that measuring involves gauging user interaetion through a 
series of questions. Sueh questions are the following: 

• How long does it take users to understand the solution? 

• How long does it take them to do their work? 

• What problems do they eneounter? 

These measurements must be both subjeetive and objeetive; time on task is 
important, but if a user takes more time but produees substantially higher quality work, 
then the weight of the improved quality needs to be eonsidered. 
a. Testing 

Testing is the aetivity of using the eonstrueted applieation to produee a 
eomplete working system by system testing, deteeting and reeording issues, fixing 
problems, and getting user aeeeptanee of the result. 

4, Learn 

During the learning phase it is deeided whieh parts of the prototype are working 
and what parts are not. 

There are many reasons a prototype does not aehieve its goals, the most 
signifieant reason being an ineorreet understanding of what the users want to aeeomplish 
(requirements). 

Iterative development proves to be the best software development proeess for this 
type of projeet due to the faet that all of the user’s needs were not known before the 
projeet started. Iterative development is based on an attitude of embraeing ehange and 
adaptation as unavoidable and indeed essential drivers. [12] 

Wilson explains that as development eycles through the phases, the developers are 
gaining a fuller understanding of the users and their needs, and eoming eloser to a 
finished produet. The development eyeles through these four phases until there is an 
agreed upon satisfaetion between the developers and the users obtained through the 
measure and learning phases. 
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The data analysis tool presented in this thesis went through four iterations before 
it eould be publiely released. Eaeh iteration is discussed in detail within the following 
sections. 

The requirements for each iteration were gathered from a series of interviews 
between the developer (this thesis’s author) and the primary researcher (Dr. Michael 
Saboe). These interviews are presented in Appendix A. 

It is also important to note that for the first three iterations the primary user of the 
developed applications was the developer himself. It is not until iteration four that 
DataThermometer is born and the user base switches. 

B, ITERATION ONE: ESTABLISHING THE PROJECT 

Iteration one marked the beginning of the software development portion of the 
project. The first iteration was where the first set of requirements was gathered and where 
the first prototype was developed. The requirements for this iteration came from the first 
interview found in Appendix A. 

1, Plan 

From the requirements interview, the goals of this iteration were to fulfill the 
requirements listed below: 

• The system shall parse user supplied data from INSPEC databases. 

• The system shall remove duplicate records from the data. 

• The system shall standardize the data fields. 

• The system shall compute a time-based entropy calculation based on the 
descriptors field and the publication year of the data. 

The goals of this iteration are also to satisfy the first two steps of the data mining 
process discussed in Chapter II (Obtain and Focus data). 

The use case diagram, Figure 7 below, shows the functionality of this iteration 
that must be available to the data analyst. 
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Two commercial off the shelf (COTS) applications facilitated in meeting the 
above requirements; The Technology Opportunities Analysis of Scientific Information 
System (Tech OASIS) and Microsoft Excel (MS Excel). 

Tech OASIS is a government-tailored software program that is commercially 
available under the trade name VantagePoint. One of the many uses of Tech OASIS is 
ability to process raw bibliographic data through the use of import filters. Once the data is 
imported, Tech OASIS can pre-process the data to remove duplicates and standardize it. 
Additionally, Tech OASIS can be extended by using the built in script editor for custom 
analysis. 

Eigure 8, below shows how a user would use Tech OASIS to process data 
obtained from bibliographic databases. 
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Figure 8. Tech OASIS data file creation. 

Microsoft Excel was used to generate the entropy charts. Tech OASIS interfaced 
with MS Excel by using the visual basic scripting feature. A script in Tech OASIS sends 
data to MS Excel and applies MS Excel functions on that data in the form of mathematic 
formulas. Once results are obtained the script runs MS Excel functions relating to chart 
generation. 

Below, Eigure 9, is a class diagram based on the analysis. The class diagram 
shows Tech OASIS has import fdters, export scripts, and data processing methods. The 
export script is used to bridge the gap between MS Excel and Tech OASIS. 
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Figure 9. Iteration one class diagram. 


The sequence diagram below (Figure 10) explains how a use would interface with 
this setup. 



Iteration One: Sequence Diagram 


Figure 10. Iteration one sequence diagram. 
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The following steps can be seen in the sequence diagram shown in Figure 10: 

1. The data analyst creates a new data file. 

2. The data analyst runs an import filter to parse the data. 

3. The data parser object updates the data file. 

4. The data analyst runs the remove duplicates command. 

5. The data fide is updated. 

6. The data analyst runs the clean data command. 

7. The data fide is updated. 

8. The data analyst executes the compute entropy script. 

9. The export script sends data to MS Excel. 

2. Implement 

The implementation consisted of creating an import filter to extract the data from 
its raw form and place it in a Tech OASIS data file. 
a. Importing the Data 

Importing the data required the use of an import filter to decipher the raw 
data and place it into a structured form. The author created a new database configuration 
file within Tech OASIS where the filter syntax was placed. Figure 11 shows the 
Database Configuration Editor with the finished import filter. After completing the filter, 
the data was ran through and organized into a structured representation in Tech OASIS. 
Several activities were then performed on the data to prepare it for the entropy analysis. 
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Figure 11. Database Configuration Editor showing the import filter. 

b. Preparing the Data 

Finishing touehes were then applied to the data to prepare it for export. In 
a diseussion in August with Robert Watts, he recommended that the dataset be “cleaned” 
before performing calculations on it. Tech OASIS cleans a list by attempting to identify 
list items that may be equivalent. For example, the terms "human-computer interaction" 
and "human computer interaction" will appear as separate items on a list (because of the 
hyphen between "human" and "computer" in the first instance). The list cleanup 
algorithms in Tech OASIS will catch this as well as plurals and simple misspellings. In 
addition. Tech OASIS can identify equivalents such as J. Smith, James Smith, and Smith, 
J. Figure 12 shows the fields that were cleaned. Note the different number of instances 


25 






































between the eleaned and non-eleaned fields. In addition to cleaning the fields, duplicate 
accession numbers were removed. An accession number is a unique number assigned to 
each record. 



Figure 12. Ada dataset summary that shows cleaned fields. 


Before duplicates were removed from the datasets there were -48,000 
records, afterwards there were 22,219. The number of records for each dataset is shown 
in Table 2. 


Dataset 

Records 

Descriptors 

Publication 

Years 

Software Engineering 

10,763 

2425 

1978-2000 

Abstract Data Types 

567 

364 

1974-2000 

Ada 

4195 

1460 

1979-2000 

Java 

5906 

2421 

1995-2000 

Rate Monotonic 
Analysis 

223 

342 

1986-2000 

Software Cost Models 

273 

394 

1972-2000 

Software Work 
Breakdown Structures 

36 

63 

1978-2000 

Software Technology 
Transfer 

256 

222 

1982-2000 

Totals 

22,219 

7691 



Table 2. Number of records for each dataset. 
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c. Export Script and Entropy Calculation 

After importing and preparing the data, an export script had to be 
constructed to calculate the entropy of the descriptors. The export script is where the user 
interacts with Tech OASIS by selecting the data and time fields used to calculate entropy. 
Figure 13 shows the script prompting the user to select the data and time fields, Figure 14 
displays a list of fields that the user can select from, and Figure 15 presents what happens 
when the user selects a time field that doesn’t contain groups (time intervals). 


|VB5crip( 


Select time field 1 

hat contains ini 

OK 

tervals as groups 

1 



Figure 13. Tech OASIS script prompts user to make a selection. 



Figure 14. User selection screen in Tech OASIS. 
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Figure 15. Error message when time field has no groups. 

The seript ereates a co-oeeurrenee matrix of the data field (rows) and the 
time field (eolumns) and exports it into Mierosoft Exeel. Then the export seript finished 
the entropy caleulations and generated the eharts. Eigure 16 shows a portion of the eo- 
oeeurrenee matrix ereated in Teeh OASIS. Eigure 17 presents the exported data in 
Mierosoft Excel. 
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Eigure 16. Co-occurrence matrix created in Tech OASIS. 


28 














































































































Time step 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 


# Records 

8 

108 

142 

146 

248 

170 

268 

288 

49 

290 

156 

137 

309 

258 

273 

# Records 

Descriptors (Cleaned) 

1979 

1980 

1981 

1982 

1983 

1984 

1985 

1986 

1987 

1988 

1989 

1990 

1991 

1992 

1993 

2697 

Ada 


48 

86 

89 

128 

79 

149 

170 

23 

198 

93 

107 

217 

176 

196 

548 

software-engineering 



14 

20 

56 

33 

33 

55 

13 

73 

33 

21 

45 

25 

27 

542 

real-time-systems 





10 

2 

9 

17 

3 

28 

20 

24 

59 

54 

49 

496 

object-oriented-programming 






1 



2 

11 

18 

30 

39 

33 

37 

395 

software-tools 







19 

33 

6 

42 

18 

24 

42 

30 

25 

389 

program-compilers 


10 

10 

14 

31 

18 

25 

28 

6 

29 

18 

11 

26 

21 

26 

280 

object-oriented-languages 













1 

10 

19 

276 

programming-environments 







48 

32 

11 

28 

19 

11 

26 

12 

13 

274 

software-reusability 









2 

9 

9 

23 

37 

23 

33 

255 

military-computing 


1 

4 

1 

13 

2 

13 

22 

8 

19 

9 

16 

34 

20 

30 

236 

programming 


4 

4 

6 

24 

16 

13 

28 

5 

25 

16 

1 

18 

18 

12 

233 
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Figure 17. Exported data in Mierosoft Exeel. 


The first iteration of the entropy calculation method was very simple in 
comparison to the methods of later iterations. The basic operations included calculating 
the cumulative entropy of the user-selected data field and creating a graph based on the 
results of the calculation. The cumulative entropy of the data field was calculated by 
applying the formula, given by Dr. Michael Saboe, to the exported co-occurrence matrix. 
The cumulative entropy formula and example are shown in Appendix B. Eigure 18 shows 
a sample of the data after the cumulative entropy formula has been applied. The resultant 
chart based on the cumulative entropy calculated is presented as Eigure 19. 
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Eigure 18. Data in Excel after applying entropy formula. 
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Cumulative Entropy vs. Year (Ada) 



3, Measure 

Measuring the user’s interactions was not important during this iteration because 
the user and the developer were the same person. The ability to perform this type of 
analysis in such a small time period provided the greatest benefit. 

4, Learn 

The initial set of requirements was fulfilled, but improvements and additional 
functionality were needed in the entropy analysis area. 


C. ITERATION TWO: EXPANDED ENTROPY ANALYSIS 

After iteration one, several requests for more functionality were received from 
Michael Saboe. As shown in the interviews from Appendix A, the first request asked for 
the addition of a power trend-line to the cumulative entropy chart. Second, a summary 
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sheet was requested. The desired features of the summary sheet were to display the 
following; 

• Time Intervals 

• Cumulative entropy in eaeh time interval (slice) 

• Percent difference of the cumulative entropy and the predicted value in a time 
interval. Predicted value comes from using the power trend-line’s equation 
(see Appendix B under “Predicted Entropy Calculation’’). 

• Predicted entropy values based on 5 years and 10 years of data 

The third request involved implementing new formulas supplied by Dr. Saboe on 
a new worksheet titled “Entropy lambda” to calculate the derivative of the cumulative 
entropy’s trend-line equation and lambda from using that calculated derivative. “Time 
Interval Derivative Calculation” and “Eambda Calculation” in Appendix B explain the 
formulas behind the new calculations. Along with the new calculations a graph was 
added to display cumulative entropy with lambda for each time interval and the 
difference of lambda subtracted from cumulative entropy. 

A fourth request was to calculate the Eyapunov exponent. The Eyapunov 
exponent used with lambda gives another method of computing entropy. This request 
added more to the “entropy lambda” sheet and a new chart. The calculation of the 
Eyapunov exponent came from another fomula supplied by Dr. Saboe. “Eyapunov 
Exponent Calculation” in Appendix B shows the formula and an example of its use. 


1, Plan 

The requirements that were added during this iteration were the following: 

• The system shall create an entropy summary sheet with the information 
specified above. 

• The system shall add a power trendline to the entropy chart. 

• The system shall format the entropy chart. 

• The system shall create and populate a worksheet titled “Entropy Eambda”. 

• The system shall calculate the Eyapunov exponent. 

The use cases in this iteration remain the same from iteration one. Eigure 20 
shows the revised class diagram. The class diagram reflects the addition of MS Excel 
macros written in Visual Basic for Applications (VBA). 
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Iteration Two: Class Diagram 



Figure 20. Iteration two class diagram 


The sequence diagram for iteration two is shown in Figure 21. The sequence 
diagram also reflects the addition of the Excel macros. 



Figure 21. Iteration two sequence Diagram 
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The sequence diagram shows the following steps; 

1. The Tech OASIS export script sends the data to MS Excel and then calls the 
Compute Entropy Excel macro. 

2. The Compute Entropy macro performs the entropy computations, generates 
the cumulative entropy summary sheet and graph. 

3. The Compute Entropy macro calls the Compute Eyapunov macro. 

4. The Compute Eyapunov macro creates a new workbook to calculate and 
store the entropy lambda sheets and graphs as well as the Eyapunov exponent 
calculation results. 


2. Implement 

During this iteration, the entropy calculation activity was moved to a MS Excel 
macro to increase system modularity and improve scalability. Another macro was 
developed to create the entropy lambda sheet, compute the Eyapunov exponent, and 
display the results. 

Eigure 22 shows the revised cumulative entropy chart with the power trend-line 
and formatting. Eigure 23 shows the generated summary sheet. 


Cumulative Entropy vs. Year 
(Ada) 



Eigure 22. Cumulative entropy chart final revision with trend-line. 
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Figure 23. Summary sheet generated by the maero. 


The additional requests added more souree code to the macro. The result of the 
additions of the formulas to calculate lambda and displaying the results is shown below. 
The “Entropy lambda” sheet is displayed in Figure 25. The graph of the results is 
presented in Figure 24. 
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Figure 24. Cumulative entropy with lambda & difference. 


The calculation of the Lyapunov exponent involved using the equation of a power 
trend-line from a new entropy chart. The new chart needed to show a map of Shk, Shk+i 
where Shk is the cumulative entropy of time step k. The resulting additions to the entropy 
lambda sheet are shown below in Figure 25. Figure 26 shows the chart, the map of Shk, 
Shk+ 1 , produced in this iteration. 
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Figure 25. “Entropy lambda” sheet with Lyapunov ealculation. 
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3. 


Measure 


As with iteration one, measuring user interaction with the system was 
unimportant. The importance of this iteration was the ability to perform new calculations 
on a standardized set of data which could then be applied to multiple datasets. 

4. Learn 

By fulfilling the requirements of this iteration focus switched to new data analysis 
methods and ways to implement greater granularity of the data presented in Interaction 
Three. 
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D, ITERATION THREE: FINALIZATION OF DATA ANALYSIS 

More analysis and calculations still needed to be added. Several algorithms and 
procedures were added during the third iteration. 

First, a higher level of granularity was needed. There were problems basing a 
publication’s date based on the “Publication Year” field. This limited the analysis to 
perform computations based on yearly time steps and produced inadequate results for 
shorter (time-wise) datasets. The need arose to perform the calculations using monthly 
increments. 

Secondly, there was great success with computing entropy based on each term 
contained in a record’s descriptor field. With this success it was determined that the 
publication’s descriptors could be combined. These combinations essentially make up the 
language of the dataset under scrutiny. 

Saboe points out, on page 67 of his dissertation (found in Appendix C), that a 
publication record contains a descriptor field. A descriptor field contains a set of terms 
that are representative of the topics covered in that publication. 

An example of the term combinations is shown below: 


A publication record contains terms A, B, C. The possible term combinations are 
the following: 


q-level = 1 

q-level = 2 

q-level = 3 

A 

AB 

ABC 

B 

BC 


C 




Q-level’s are used to describe the n-tuple (single, double, triple, and so on) 
combination of the terms for a publication record. From the above example the language 
of the dataset of one publication record is {A, B, C, AB, BC, ABC}. 


Third, the dataset was broken up by bands based on the production rate of an 
affiliation. This is described below under implement. 

Finally, Saboe described a new set of calculations for describing the pressure and 
temperature of the dataset. The theory behind these calculations is found in his 
dissertation. An excerpt from the dissertation is found in Appendix C and shows that the 
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temperature and pressure funetions are dependant on the eumulative entropy, number of 
terms, and number of authors. 


1, Plan 

The following are the set of requirements for iteration three; 

• The system must perform ealculations by month. 

• The system must establish the vocabulary of the dataset by obtaining the 
combination of terms 

• The system must break the dataset up into bands based on the production rate 
of affiliations. 

• The system must compute the entropy, temperature, and pressure of the 
dataset on both combined and uncombined terms. 

The use cases remained the same from iteration one except for two new use cases 
that came into existence to facilitate features of MS Access. Figure 27 shows the added 
use cases. 

Access allowed the data to be organized better than Tech OASIS through the use 
of relationships and provided different data views with its query capabilities. Access was 
especially needed to provide the time interval relationships to the accession number field 
which gave the capability to perform calculations in monthly instead of yearly intervals. 
Access also facilitated re-usability. The same database schema was used for each dataset. 
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The class diagram had to be expanded to account for the use of MS Access. 
Shown below in Figure 28, it is apparent that MS Excel and MS Access interface with 
each other through the use of VBA programs. 


Iteration Three: Class Diagram 



Figure 28. Iteration three class diagram. 


A typical sequence of events is shown in the two sequence diagrams below. 
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Figure 29. Iteration three sequence diagram one of two. 


Figure 29 shows the following sequence of events: 

1. The data is exported from Tech OASIS to a MS Excel workbook via export 
script. 

2. The export script runs the Assign Month & Year macro. 

3. The Assign Month & Year macro places the month and year the publication 
was entered into the INSPEC database next to the publication’s title. 

4. Once the Assign Month & Year macro finishes, the Combine Terms macro is 
called to run. 

5. The workbook is updated with the combined terms. 

6. The data analyst manually imports all the worksheets into a newly created 
MS Access database. 
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Figure 30. Iteration three sequence diagram two of two. 


Figure 30 shows the following sequence of events: 

1. The user manually imports the MS Excel worksheets into MS Access. 

2. The user runs the Export Query macro from within MS Access. 

3. The Export Query macro creates the co-occurance matrices through queries 
and sends the results to MS Excel. 

4. The data analyst starts the compute entropy macro. 

5. The Compute Entropy macro performs the entropy computations, generates 
the cumulative entropy summary sheet and graph, entropy lambda sheets and 
graphs, and temperature and pressure graphs. 

6. The data analyst executes the Compute Eyapunov macro. 

7. The Compute Eyapunov macro creates a new workbook and calculates the 
lyapunov exponent. 

8. The data analyst runs the Compute n-Tuple Entropy macro. 

9. The Compute n-Tuple Entropy macro creates a new workbook where the 
results are stored. 

As part of the database design an entity relation diagram (ERD) was created to 
show the relationships between the export data fields from Tech OASIS. Eigure 31 shows 
the ERD. 
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Dataset Database File 



Figure 31. Iteration three database entity relation diagram. 

2. Implement 

a. Calculations by Month 

The export script was re-written. Instead of exporting a co-occurance 
matrix created by Tech OASIS, the export script now sends a list of data from a user 
selected data field associated with accession numbers to maintain record divisions and 
relationships. 

During execution of the Tech OASIS export script, the user is prompted to 
select the data fields to export. The first prompt, shown in Figure 32, asks the user to 
select a unique identifier field. The next series of prompts ask the user to select the data 
fields, shown in Figure 33. A typical list of data fields is shown in Figure 34. The 
INSPEC accession number field is the one usually chosen as the unique identifier. The 
other important fields for this research included the Title, Affiliation, Authors, and 
Descriptors fields. 
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VBScript 


Select unique identifier field 


OK 


Figure 32. Prompt for unique identifier. 


VBScript 

ST 

Select data field 1 | 

1 1 


1 
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VBScript 

■-- ^ 

3 

Select data field 4 

1 1 


1 
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VBScript 

■- ^ 

3 

Select data field 3 | 

1 1 


1 

J 



Figure 33. Prompts to select data fields to export. 



Figure 34. List of fields to select. 
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Once the fields are selected, the export script creates a eo-oecurrenee 
matrix in Teeh OASIS of the data field values on the y-axis and the unique identifier field 
on the x-axis. 
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Figure 35. Export seript eo-oceurrence matrix. 


The export seript traverses this co-oeeurance matrix (shown in Figure 35) 
in a 2-d path. A column is selected and eaeh row is examined in that column. If the value 
of that column and row combination is not empty then the value in the eolumn header is 
sent along with the value in the row header to MS Excel. Figure 36 shows the resulting 
values in MS Excel. The final souree code for the export seript is found in Appendix K. 


^Book1 
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B C D ' E — 1 
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3 

2076615 
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7 
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The transfer of university software for ind 
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Imperial Software Technology Limited ^ 
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Figure 36. Data exported for the Title field into MS Exeel. 
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After the exportation completed, the month and year needed to be assigned 
to the publications. This was accomplished by another VBA macro. The macro traversed 
the worksheet containing the title of the publications while examining a list of accession 
numbers to see what month and year to place next to the title. See Appendix L for the 
list of accession numbers and the source code of the macro to assign the dates. Figure 37 
shows the results of the title worksheet after macro completed. 


^Book1 

Q'n'x 

ABC 


1 llNSPEC>!TitIe [PubDate 

PubYear ^ 

2 2046766 Technology transf 6/1/1983 

1983 

3 2076615 Early experiences 8/1/1983 

1983 

4 2078644 A comparison of c 8/1/1983 

1983 

5 2106610 Technology transf 10/1/1983 

1983 

6 2143179 Software engineer 12/1/1983 

1983 

7 2159589 The transfer of uni 1/1/1984 

1984 

8 2186146 Imperial Software 2/1/1984 

1984 

m" * ►”ViX AN Xmle/ Sheetl / ] i'j 

■~;ir 


Figure 37. Exported data after the date have been assigned to the Title field. 


b. Term Combination 

After applying the date to the records the descriptors were ready to be 
combined to generate the language of the dataset. The descriptors needed to be combined 
in a way that the powerset of the descriptors field is represented for every publication. 
Another macro named the “Term Combination” macro was written to accomplish this. 
The source code for the macro is found in Appendix M. 


The figures below show the results of the term combination macro. 
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Figure 38. Single descriptors. 
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Figure 38 shows the single descriptors of the publication indexed with the 
accession number 2186189 in bold. The worksheet of single descriptors is used as input 
to the term combination macro. Figure 39 shows the terms of the same records combined 
to create double terms, Figure 40 shows those same terms combined to create the triple 
terms, and Figure 41 shows them combined to create the quadruple terms. Thus the 
powerset of the descriptor field for that one record is complete. 
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Figure 39. Terms combined to make double terms. 
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Figure 40. Terms combined to make triple terms. 
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Figure 41. Terms combined to make quadruple terms. 
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c. Import into MS Access 

Next all the sheets created from the Tech OASIS exportation script and the 
Term Combination macro need to be imported into Microsoft Access. 

The relationships outlined in the above ERD make the database useful. 
The relationships are exploited through the use of queries. 

The process takes many tedious steps to complete for each dataset Which 
will have to be automated in a future iteration. The process of creating and population the 
database is as follows; 

1. Create a new database 

2. Select the import command under the file menu and under “Get External 
Data” see Eigure 42 below. 

3. Select the filename 


Q Microsoft Access 



RIe [ Edit yew Insert Tools Window 

' ti®ip 


Ql New,.. Ctrl+N 

^en... Ctrl+0 

L - ibB 


ll Get External Data ^ | 

o 

Q. 


1 Close 

Link Tables.., 

P 





in Design view 


{by using wizard 
jby entering data 

! 

I 

Database Properties ' 

itest.mdb 

2 \My Documents\.. .\WBS.nndb 

3 \..,\abstract data types.mdb 

4\MyDocuments\...\SW_Eng.mdb | 



Eigure 42. MS Access import command. 


4. Then select the worksheet to be imported. Shown in Eigure 43. 

5. Set the column heading and primary keys. As shown on the ERD in Eigure 31, 
the only primary key needed is for the “Title” table. 
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^Import Spreadsheet Wizard 


Your spreadsheet file contains more than one worksheet or range. Which worksheet or range 
would you like? 


f* Show W^orksheets 
^ Show Named Ranges 



Source Publication Year (T) 

Authors 

Affiliation 

AN 

n-Terms-01 


0 


Sample data for worksheet 'Title'. 


1 

INSPEC Accession Number 

Title 

2 

2046766 

Technology transfer of a sof 

3 

2076615 

Sarly experiences regarding 

4 

2078644 

ft. comparison of design strat 

5 

2106610 

Technology transfer: the key 

_6_ 

2143179 

Software engineering: strate 





Figure 43. Worksheet selection. 


6. The queries need to be created after the importation of the worksheets into 
MS Access. The MS Access query designs are show below in Figures 44-48. 



Figure 44. Co-occurrence matrix query design (descriptor instances over time). 
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Figure 45. Co-occurrence matrix query (authors instances per affiliation over time) 



Figure 46. Co-occurrence matrix query design (affiliation instances per affiliation over time) 



Figure 47. Co-occurrence matrix query design (descriptor instances over time - descriptor 

and affiliation pair). 
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Figure 48. Co-occurrence matrix query design (n-term instances over time) 


The last query shown must be replicated for all the q-levels or term 
combination tables. 

The queries’ results then need to be exported to allow the analysis macros 
to run on them. This is done by selected the export command under the file menu while 
viewing the query. Another VBA macro was created to speed up the process of exporting 
the queries relating to term combination. The source code to the macro is shown below 
in Figure 49. 


Sub OutputDataToExcelQ 

For i = 1 To 8 T to the number of q_levels, this must be changed for each dataset. 

If i < 10 Then 
'terms: 

DoCmd.TransferSpreadsheet acExport, acSpreadsheetTypeExcel9, "0" & i & " term month", 
"c:\dataout\WBS_qlevels_term_month.xls" 

Else 

'terms 

DoCmd.TransferSpreadsheet acExport, acSpreadsheetTypeExcel9, "" & i & " term month", 
"c:\dataout\WBS_qlevels_term_month.xls" 

End If 
Next i 

End Sub 


Figure 49. Code to export queries to MS Excel 
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d. Affiliation Distribution Analysis Macro 

With the co-occurrence matrices in MS Excel, powerful VBA macros 
could be ran on the data. The VBA macros reside in MS Excel workbooks along with 
skeletons of the charts. The macros copy the charts into the workbook after the analysis 
results are stored in the workbook. 

The first macro is called “Affiliation Distribution”. This distributes the 
records based on the production of each affiliation. This macro requires four co¬ 
occurrence matrices exported from MS Access. The four needed are the following: 

• Matrix of descriptor (term) instances per descriptor over time. See query in 
Eigure 44. 

• Matrix of author instances per affiliation over time. See query in Eigure 45. 

• Matrix of affiliations instances per affiliation over time. See query in Eigure 
46. 

• Matrix of descriptor instances per descriptor and affiliation pair over time. 
See query in Eigure 47. 

The records are distributed into four bands based the number of records 
produced by an affiliation. Eigure 50 shows the created distribution sheet. 


Statistics 



Mean 


1.892197 

Band Range 

Stdev 


2.522752 

Band D Band C Band B Band A 

Sum 
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65 
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9.46 
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25 
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>25 
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Eigure 50. Distribution of affiliations into bands for the Ada dataset. 
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The bands are determined by using the mean (g) and the standard 
deviation (a) of the number of publications each affiliation produced. The affiliations in 
each band are based on the number of records produced n and are determined as follows; 

Band D; 0 > n < p+la. 

Band C: p+la > n < p+2a. 

Band B; p+2a > n < p+3a. 

Band A p+3a > n 

The discussion of the purpose of dividing the dataset into bands is 
explained in Saboe’s dissertation on pages 126-132 which have been included in 
Appendix C. 


After the affiliations have been distributed, entropy, temperature, and 
pressure can be calculated for each band and the world (no affiliation separation). The 
results from the bands provided gave poor results so the focus of this research centered 
on the dataset as a whole. After the calculation algorithms have finished, summary sheets 
and charts are generated such as the one show in Figure 51. 
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Figure 51. Summary sheet showing descriptor term instances for the Ada dataset. 


Temperature of a time step is calculated from the change in cumulative 
entropy and change in the number of terms in the dataset. Figure 52 shows the results of 
the temperature calculation highlighted on a summary sheet. Saboe describes this 
calculation on pages 116-121 of his dissertation which have been included in Appendix 

C. 
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Figure 52. Summary sheet showing temperature calculation for the Ada dataset. 


Saboe describes pressure as the calculation of messages processed per 
node. Figure 53 shows the results of the pressure calculation highlighted on a summary 
sheet. 
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Figure 53. Summary sheet showing pressure calculation for the Ada dataset. 
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Figure 54 shows the resultant graph from the two calculations of pressure 
and temperature that was used in Dr. Saboe’s dissertation on page 120. 

Pressure and Temperature 
vs timestep 

Temperature and Pressure vs Timestep 



Figure 54. Pressure and Temperature Graph 


Several other analyses are produced from the “Affiliation Distribution” 
macro, but are not significant enough to report here. See Appendix N for the final source 
code and Appendix H for all the output charts. 


e. Entropy Lambda Analysis Macro 

The second macro is called “Entropy Lambda”. This macro uses results 
from the first macro, specifically the “Affiliation Summary” sheet shown in Figures 52 
and 53. During this iteration the macro is refined to provide more precise results. Instead 
of guessing the beta percentage (see Saboe 161-162, 201-202 in Appendix C) the beta 
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Beta per Author Entropy (SH) 


percentage is determined. Source code is found in Appendix O. The results of this macro 
are shown below in Figures 55 and 56. 


Entropy and Beta_k 



k Timestep (years) 


Figure 55. Entropy over time with beta fit. 
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Figure 56. Author instances over time. 
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Q Level Analysis Macro 


/ 

The final analysis macro is called “Q_Level_Analysis”. This macro 
analyzes the term combinations. The inputs to this macro are the co-occurrence matrices 
of the q-levels. The matrices come from the query design shown in Figure 48. The query 
must be ran on every term combination table. 

From the results it can be seen the “Q Level Analysis” macro determines 

the following. 


• Figures 57 & 58 - Number of term instances in all the q-levels over time. 

• Figure 59 - Cumulative entropy of the terms in all the q-levels over time. 

• Figure 60 - Temperature of the dataset base on the term combinations. 


q-level Distribution 
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Figure 57. Term combination distribution. 


57 















Entropy Term Instances 


number of term instances of all q over time 



Figure 58. Number of eombined-term instanees. 
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Figure 59. Entropy of eombined-terms. 
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Figure 60. Temperature of the dataset. 

All the output charts are shown in Appendix J. The code for the 
“Q Level Analysis” macro is presented in Appendix P. 

3, Measure 

With the addition of new analysis techniques and the refinement of old ones, more 
accurate and precise analysis was realized during this iteration. Dr. Michael Saboe was 
able to complete his dissertation and obtain his PhD. 

Measured user interaction for this iteration found a major problem. The main user 
is the system developer and he is the only one who knows how to use these analysis 
functions due to the substantial amount of data preparation required to allow the 
functions to execute correctly. 
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4. 


Learn 


Most of the data analysis techniques finalized in this iteration are ready for 
production. One analysis area that needs to be looked at is with the banding. The banding 
analysis charts from the “Affiliation Distribution” macro was not used and need to be 
revised or omitted from future iterations. 

The major lesson learned from this iteration is that in order for the above analysis 
techniques to be applicable to future research the amount of user-interaction needs to be 
greatly reduced. 


E, ITERATION FOUR: PACKAGING & INTEGRATION 

With Iteration three marking the final development stage of the data analysis 
process, the next logical step is to develop an application to manage the process. Iteration 
four looks at taking the previously developed analysis techniques and packing them into 
one application. This new application has been dubbed DataThermometer. 
DataThermometer consists of a graphical user interface (GUI) that minimizes the amount 
of user interaction by providing one interface to the data analysis functions and the 
results. The Major features of DataThermometer are listed in Table 3. 


Feature # 

Feature 

1. 

Imports data from Microsoft Excel spreadsheets. 

2. 

Stores the data in a relational database. 

3. 

All functionality is performed by the user through DataThermometer’s 
graphical user interface (GUI). 

4. 

User-selected data processing methods are applied to the data. 

5. 

The results of the data analysis are in the form of Microsoft Excel 
spreadsheets. 

6. 

DataThermometer keeps track of the data processing methods already ran 
on a dataset. 


Table 3. Major Features of DataThermometer. 


Implementation for iteration four was not finished due to the death of Dr. Saboe. 
Even though implementation never occurred, extensive planning was still accomplished. 
The next section presents the class, use case, and sequence diagrams. The Software 
Requirements Specification (SRS) document, found in Appendix D, shows a detailed 


60 




overview of the project along with and the use cases descriptions presented in Appendix 
E. 


Iteration Four: Class Diagram 



Figure 61. Iteration four class diagram. 


The above class diagram shows that DataThermometer links many MS Excel 
workbooks to many MS Access databases. The workbooks hold the data to be imported, 
and the results of the analysis functions. The databases contain the imported data and 
queries to re-organize and manipulate the data. 
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The above use ease diagrams show the funetions DataThermometer 
provides to its users. Eaeh use ease is deseribed in detail in Appendix E. 
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Figure 64. Iteration four sequence diagram one of two. 


Figure 64 shows the following sequence of events: 

1. The data is exported from Tech OASIS to a MS Excel workbook via export 
script. 

2. The user selects “Create Data File” command in DataThermometer. 

3. DataThermometer creates a new data file instance. 

4. The data file object requests the creation of a new database. 

5. The database is created. 

6. The user selects the “Import Data from Excel” command. 

7. The DataThermometer GUI instantiates the DataThermometer Data Importer 
object. 

8. The Data Importer object executes MS Access functions to import MS Excel 
spreadsheets. 

9. MS Access requests and receives the data from the MS Excel workbook. 

10. The user selects the “Exit DataThermometer” command. 

11. DataThermometer closes the opened data files and terminates. 
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Iteration Four; Sequence Diagram (2) 


Figure 65. Iteration four sequence diagram two of two. 


Figure 65 shows the following sequence of events: 

1. The user selects the “Open Data File” command. 

2. DataThermometer loads the datafile. 

3. The user selects the “Open Help Documentation” command. 

4. DataThermometer opens the help documentation in another window. 

5. The user selects the “Perform Analysis Function” command and then chooses 
the “perform the entropy calculation” option. 

6. DataThermometer instantiates the Analysis object. 

7. The Analysis object performs operations on the MS Access database. 

8. The MS Access database exports queries to MS Excel. 

9. The Analysis object finalizes the calculations based on the exported queries. 

10. The Analysis object updates the data file by noting the completion of the 
analysis function and setting the pointer to the results. 

11. The user selects the “View Analysis Results” command and chooses to view 
the entropy calculation results. 

12. DataThermometer instantiates the Analysis object. 

13. The Analysis object sends a request to the data file for the location and name 
of the file that contains the results. 

14. The analysis object opens the MS Excel workbook with the results. 

15. The user selects the “Close Data Eile” command. 

16. DataThermometer unloads the data file. 
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IV. RESULTS, CONCLUSIONS, AND FUTURE WORK 


In this thesis three questions were investigated. The questions were presented in 
Chapter I and are the following: 

1. Can the data analysis process be systemized? 

2. What are the steps required to determine the temperature of a set of 
bibliographic records? 

3. Can the learning curve of the process be reduced? 

The answers to each of them lie in Chapter III and are explicitly stated in the 
following paragraphs. 

Systemization of data analysis process proved to be successful. Based on the 
research and software development performed for this thesis, the first question was 
answered. By examining the four software development iterations presented in Chapter 
III, it can be seen how the process evolved. Figure 66 below shows the process flow as it 
existed during the final process development iteration. Notice how the chart parallels the 
KDD process chart described in Chapter II. 
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Figure 66. Iteration three overall process flow. 

The steps required to determine the temperature of a set of bibliographic records 
are stated in Chapter III. There are two routes that can be taken when calculating the 
temperature. The calculation can be applied to a dataset’s combined-descriptor term 
instances or just to a dataset’s single-descriptor term instances. The exact steps involved 
to perform the temperature calculation on the combined terms are shown in Table 4, 
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where the steps are weighted based on ease of use beeause some steps require more effort 
than others. For example, setting up the queries requires substantially more effort than 
elieking a button to eomplete a task on a GUI. The seale ranges from 1 to 5 where 1 takes 
the least amount of effort and 5 takes the most. 


Task 

Effort 

1. Obtain raw data from bibliographie databases. 

3 

2. Open Tech OASIS. 

1 

3. Create an import filter if one does not exist. 

3 

4. Import the raw data into Tech Oasis through the use of an import 
filter. 

2 

5. Remove the duplicate records in the dataset. 

2 

6. Standardize (clean) each of the data fields. 

2 

7. Send the data in the data fields to MS Excel along with the 
unique identifier for each record via Tech OASIS export script. 

2 

8. Apply the month and year information to each record based on 
the accession number by using the assign accession number 
macro. 

2 

9. Combine the terms together using the Term Combination macro. 

2 

10. Open MS Access. 

1 

11. Create a new database. 

1 

12. Import each term combination MS Excel worksheet into a 
separate MS Access database table. 

5 

13. Import the other MS Excel worksheets into MS Access (title, 
affiliation, authors, single descriptors). 

3 

14. Set up the MS Access queries as described in Iteration Three. 

4 

15. Export the results of each term combination query (co¬ 
occurrence matrix) to a MS Excel workbook. 

2 

16. Export the results of the other four queries - co-occurrence 
matrices of single-descriptors, authors, affiliations, and single¬ 
descriptor terms by affiliation to a separate MS Excel workbook. 

3 

17. Run the “Affiliation Distribution” macro on the workbook 

2 

18. Copy the “Affiliation Summary” sheet produced by the macro 
execution from the previous step to the workbook with the term 
combinations (see step 14). 

2 

19. Run the “Q Eevel Analysis” macro on the workbook with the 
term combination sheets and “Affiliation Summary” sheet. 

2 

Total Effort Required 

44 


Table 4. Steps and effort required to eompute the temperature of the dataset with eombined 

terms. 
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The last question also produees a favorable answer. The learning eurve of the 
proeess ean be redueed. Iteration four eoneerns paekaging and integration, although not 
implemented, the iteration was well planned. The planning shows that adding one 
interfaee for all the importation and data analysis funetions greatly reduee the number of 
tasks the user has to perform. To prove this, the steps listed below perform the same 
temperature ealeulation on the eombined terms that is deseribed above. 


Task 

Estimated 

Effort 

1. Obtain raw data from bibliographie databases. 

3 

2. Open Tech OASIS. 

1 

3. Create an import filter if one does not exist. 

3 

4. Import the raw data into Tech Oasis through the use of an 
import filter. 

2 

5. Remove the duplicate records in the dataset. 

2 

6. Standardize (clean) each of the data fields. 

2 

7. Send the data in the data fields to MS Excel along with the 
unique identifier for each record via Tech OASIS export 
script. 

2 

8. Open DataThermometer. 

1 

9. Create a new DataThermometer datafile. 

1 

10. Use the “Import Data from Excel” command to import data 
into DataThermometer. 

2 

11. Select the “Perform Data Analysis” command. 

1 

12. Select “Temperature Calculation” 

1 

13. Select “View Data Analysis Results” 

1 

14. Select “Temperature Calculation” 

1 

Total Effort Required 

23 


Table 5. Steps and effort required to eompute the temperature of the dataset with eombined 

terms after integration. 

As shown in the above tables, the total effort required goes from 44 to 23 with the 
use of DataThermometer’s GUI to manage the handling of the data, ealeulation 
algorithms, and the analysis results. 

An implementation methodology and software tool for an entropy based 
engineering model for an evolving system has been proven to work. The implementation 
methodology was applied to eight different sets of data and behaved as expeeted. 
Furthering this proof is the faet that the results obtained from the developed ealeulation 
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algorithms provided Dr. Michael Saboe an experimental reference to his doctoral 
dissertation. The resultant charts can be seen throughout his dissertation. 

Future work involves the completion of the implementation of Iteration Four. 
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APPENDIX A - REQUIREMENTS ELICITATION 


The requirements of the system were obtained through a series of one on one 
eommunieations with the primary researeher, Miehael Saboe. The implementation 
followed an iterative proeess. Eaeh iteration produeed a prototype. Mike supplied the 
ealeulations, I implemented them so they eould be applied to all of the datasets in his 
researeh, and finally I sent the results to him for review. Then, either revisions were made 
to the implementation to eorreet any errors or new funetions were implemented to 
perform further data manipulation. 

The implementation proeess eontinued for several months. From the proeess, 
several funetions in the form of Mierosoft Exeel maeros and Mierosoft Aeeess modules 
written in Visual Basie for Applieations (VBA) evolved. It was then determined that 
these maeros need to be plaeed inside a paekage with a GET so they ean be used to 
proeess future datasets and preserve the data proeessing methods. 

The below interviews deseribe how I obtained the requirements for the maeros: 

Note: The interviews are paraphrased. 

Interview One: Establishing the problem, the beginning. 

I asked Miehael Saboe what the primary purpose of the data analysis was. Mike 
told me the purpose was to provide an experimental referenee to his dissertation on 
teehnology transfer. I asked several questions to gain some more insight into the analysis 
funetions of the desired applieation. The following are some of the questions and 
answers: 


Matt: How will this application provide an experimental reference to your dissertation? 

Mike: The application will perform calculations on eight sets of data and generate results 
in the form of spreadsheets and charts. 

Matt: Where does the data come from? 

Mike: The data comes from online databases, specifically INSPEC. 

Matt: You’ll be supplying the data, right? What is in the data? 

Mike: Yes, I’ll give you the data. The data contains publication information on specific 
topics that I searched for. For example. I’ve searched for publication records using 
the following keywords: Ada, Java, Software Engineering, etc. 

Matt: What publication information is in the data? 

Mike: The key items are: title, descriptors, authors, affiliations, and accession number; a 
unique identifier INSPEC uses, which you could use to build relationships 
between the data fields. 

Matt: What types of calculations are to be performed? 

Mike: For starters, I need the entropy of each dataset’s descriptors over time. 

Matt: What do you mean, time? 

Mike: Yearly intervals. 

Matt: What’s a descriptor? 

Mike: A data field that gives the keywords used to index a paper. 

Matt: Will you be supplying the equations for the calculations? 

Mike: Yes, but implementation is up to you, as long as I have the results in a Microsoft 
Excel spreadsheet. 


71 



Matt: So you want me to parse the data and store the information somewhere? 

Mike: Yes, I recommend you talk to Bob Watts in the NAC [National Automotive 
Center], he has a software tool called Tech OASIS that allows importation and 
manipulation of publication data from INSPEC. 

The following are the key points from this interview: 

• Raw data of publication records will be supplied. 

• Tech OASIS can be used to import and clean the data. 

• The equations to be implemented will be supplied to the developer. 


Interview Two: The use of Tech OASIS 


In this interview, I spoke with Bob Watts from the NAC concerning the use of 
Tech OASIS to import the data. 

Matt: What does Tech OASIS allow me to do? 

Bob: Tech OASIS allows you to import your data by using an import fdter to parse the 
data fields. After that you can remove duplicate records and clean the data up. 

Matt: How does an import fdter work? 

Bob: You setup the import filter to recognize data fields and have it extract the data in 
that field. 

Matt: What do you mean, clean the data? 

Bob: Tech OASIS can identify two items that are different but equal in a datafield. 

Matt: Isn’t that an oxymoron? 

Bob: Not in this case, for example human-machine interface and human machine 
interface are listed as two different entries but are really the same thing. 

Matt: So you recommend cleaning the dataset? 

Bob: Yes. 

Matt: How do I get the data to Microsoft Excel for further manipulation? 

Bob: You can do this through Visual Basic automation scripts. 

The following are the key points from this interview: 

• Create an import filter to parse the data 

• Remove duplicate data using Tech OASIS 

• Clean the data using Tech OASIS 

• Create an automation script to export the data to MS Excel. 

Interview Three: Expanded Entropy Analysis 

This interview took plaee after Teeh OASIS was used to filter the raw data 
and perform the initial entropy analysis. 

Mike: The initial analysis looks good. I have a couple of suggestions and requests. 

Matt: What are they? 

Mike: First, the cumulative entropy chart needs to show a power trendline. 

Mike: And a summery sheet needs to be created. 

Matt: What goes on the summary sheet? 

Mike: Time intervals, cumulative entropy for each interval, and some calculations based 
on the trend line. 

Mike: I also have a couple of new equations. First we’ll need a new sheet to store and 
calculate a Lambda value and then the Lyapunov exponent needs to be calculated. 
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Matt: OK, send me some documentation describing the calculations and why they are 
important. 

The following are the key points from this interview: 

• Add a power trendline to the entropy chart. 

• Generate a summary sheet 

• Calculate lambda and the Lyapunov exponent 

Interview Four: More Calculations 


This next set of questions and answers describes additional calculations 
that need to be performed; 

Matt: I’ve done the entropy calculation of descriptors over years, now what needs to be 
done? 

Mike: Now that we’ve done this by years, we need to do it by month. 

Matt: What else? 

Mike: We need to know the number of authors, records, term instances, and affiliations 
per month. 

Matt: You also mentioned something about combination of terms? 

Mike: Yes, we need to know the combinations of doubles, triples, quadruples, ect. of all 
the terms in order to establish the vocabulary of the dataset. After you obtain the 
vocabulary you need to compute the entropy of the vocabulary, the temperature 
and the pressure. 

Matt: You’ll give me the equations to do this right? 

Mike: Yes, first determine the vocabulary and we’ll move further from there. 

The following are the key points from this interview: 

• Perform calculations by month. 

• Obtain the number of authors, records, terms instances, and affiliations per month. 

• Establish the vocabulary of the dataset by getting the combination of terms 

• Compute the entropy, temperature, and pressure of the dataset. 


Interview Five: Packaging the data manipulation functions inside a GUI, 

The Creation of DataThermometer: 

The set of questions and answers below describe the problem of 
preserving the functions created. 

Matt: After processing all the data you need for your dissertation, what is the next step. 
Mike: We need to use the methods you’ve implemented to process future datasets. 

Matt: Well, at the current time. I’m the only one that knows how to do this. 

Mike: This is true, that’s why you should put a front-end on these functions. 

Matt: Are you suggesting a GUI? 

Mike: Yes, a graphical user interface would work, provided there were detailed 
instructions included with it. 

Matt: Right, using Visual Basic would speed this along since most of the code is 
compatible with this language. 
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The following are the key points from this interview: 

• The methods of processing the data must be able to be used on other datasets. 

• Create a GUI using Visual Basic to preserve the data manipulation functions. 

• Create documentation outlining the processes used to complete the data analysis. 

From this last interview, the following problem statement beeame clear: 

A solution needs to exist that will preserve the calculation and data processing 
methods derived from Michael Saboe’s doctoral dissertation. This solution must have a 
user interface which will let the user perform, at minimum, the following tasks: 

Select the data processing methods 
Generate and store results 

From this problem the project proposal became apparent. 

Project Proposal 

Project Title: DataThermometer: an application to measure the stability and 
complexity of a technology and the transfer channel. 

Background: The application is based on the technology transition model 
developed by Michael Saboe for his doctoral dissertation. 

Main customer: Researchers 

Goal: To create an application, with a user interface, that will take in a dataset 
and output intensive and extensive quantities. 

Problem statement: 


Elements 

Description 

The problem of 

duplicating the research results of this doctoral 
dissertation 

affects 

researchers 

the results of 


which 

is wasted effort implementing methods that have been 
already developed. 

Benefits of 

a new application to address the problem include: 

• Ability to extend the research into other areas. 

• Saves research time, and 

• Research provides a useful tool instead of just a paper. 
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APPENDIX B - SAMPLE CALCULATIONS 


Entropy definition 

Michael Saboe describes entropy and how it relates to information theory in his 
doetoral dissertation (also presenting in Chapter I): 

Entropy as a concept can readily be seen as logical entropy (think of it as a 
measure of uncertainty, noise, non-signal, process inefficiencies, the percentage 
of work resulting in defeets and requiring rework, ete) and physical or 
thermodynamic entropy (i.e. mixed-up-ed-ness, disorder, disorganization, ete), 
which is the quantity of energy not available to do work. Logical entropy is 

Shannon's entropy ^ as defined by Shannon on his treatise on communication 
theory (Shannon 1948). Shannon’s theory says that the entropy of an information 
souree measures how well its behavior (e.g. the next symbol in a sequenee it 
produced) can be predicted. 

The definition of entropy here is related to the definition of entropy in 
thermodynamics. What follows is a basic review of entropy in information theory 
after Shannon (1948). Let Xbe a discrete random variable with alphabet 5"and a 
probability mass function p(x)=?x{X=x}, xeS. p(x) and p(y) refer to two 
different random variables and are in faet two different probability mass funetions 
p^(x) a.ndpy(y). 

The definition of information entropy is: 

Pi^) 


Sh is the entropy measured in bits, and the log is base 2. Logi will be 
assumed throughout unless otherwise noted. Lor example, the entropy of a fair 
coin toss is 1 bit. The eonvention of 0^ log 0^ ^0 is used, which comes from 
continuity since x log x ^0"^, as x ^0"^. The base of the log is two for the natural 
units of information entropy as developed by Shannon (Shannon 1948). The 
entropy is a function of the distribution of X. It does not depend on the actual 
values taken by the random variable X, but only on the probabilities. (Saboe, 
2001) 


Entropy Calculation Equations and Example: 

The formula used in this catenation is the following: 


( 1 . 1 ) 
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Where p(x) is the probability of term usage. The probability of term usage is the 
cumulative number of a single term’s instances up to the given time interval divided by 
the number of terms instances of all the terms up to the given time interval. The 
following two tables give an example of the calculation: 

Table B-1 

Sample Data 


Term 

1989 

1990 

1991 

A - # of instances 

2 

3 

5 

B - # of instances 

1 

5 

18 

Local sum 

3 

8 

23 

Cumulative sum 

3 

11 

34 


Table B-2 


Sample Entropy Calculation Example 


Term 

1989 

1990 

1991 

Entropy of A 

-(2/3) * log2(2/3) 

= 0.3900 

-(5/11) * log2(5/ll) 

= 0.5170 

-(10/34) * log 2 (10/34) 

= 0.5193 

Entropy of B 

-(l/3)*log2(l/3) 

= 0.5283 

-(6/11) *log 2 (6/11) 

= 0.4770 

-(24/34) * log 2 (24/34) 

= 0.3547 

Cumulative entropy 

0.9183 (a + b) 

0.9940 

0.8740 


Predicted Entropy Calculation 

The predicted entropy value for a time interval is calculated using the trend-line 
power equation (the least squares fit through points): 

y = cx^ (1.2) 

where c and b are constants. The time interval replaces x. An example from the 
Ada dataset follows in Table B3. 

Percent Error of Actual vs. Predicted 


Percent error of actual vs. predicted is calculated using the formula below and an 

example follows in Table B3. 

^ Predicted - Actual 
Error =- 

Actual n 3t 
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Table B-3 


Predicted Entropy Calculation and Error Example 


TimeT 

Slice 

Actual: 

Cum Entropy 

Error (Act vs. 
Pred) 

y = 

4.7404x^.1489 

Predicted: 

5 years of data 

1 

1979 

4.48385619 

5.71% 

4.48385619 

2 

1980 

5.406900167 

-2.86% 

5.406900167 

3 

1981 

5.805013635 

-3.93% 

5.805013635 

4 

1982 

6.057909749 

-3.94% 

6.057909749 

5 

1983 

6.082181413 

-1.11% 

6.082181413 

6 

1984 

6.106601538 

1.19% 

6.179377493 

7 

1985 

6.355700897 

-0.53% 

6.321976128 

8 

1986 

6.52682382 

-1.21% 

6.44815784 

9 

1987 

6.549095131 

0.19% 

6.561546835 

10 

1988 

6.611519798 

0.80% 

6.664665264 


Note: First 5 intervals under the predicted column are copied from actual. 


Time Interval Derivative Calculation 


Computing entropy from a process of transferring information, from using the 
Lyapunov exponent and lambda, is also described in Michael Saboe’s doctoral 
dissertation: 


We compute entropy two ways. One is from data, which is acquired 
experimentally. Using the cumulative approach seems appropriate since the 
messages are persistent and available to all of the future researchers to examine. 
The other is from a model of the process of transferring information. The 
experimental entropy data is related to the information we know about a topic. 
We refer to this as Shannon’s entropy (Sh)- The data Sh is gathered over time 
steps k. We perform regression on this data and have as a result a function that is 
of the power law form (e.g., y=mx^, where m is the slope and b is the intercept). 
We also have a model of a non-linear dynamical system. The Lyaponuv exponent 
of a map gives the sensitive dependence upon initial conditions that is 
characteristic of chaotic behavior. 

The eigenvalue of the Jacobian of the finite difference equations 
representing the dynamical system is also related to entropy. The Jacobian matrix 
is simply the derivative of a j!7-dimensional map function F. Saboe uses the form 
x„+j = F(x^, where x is a /»-dimensional vector. In fact, the relationship is 
through the Lyaponuv number. 

The Lyaponuv number and this relationship is defined as 
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Jn = [J{Xn) JiXn-l): JiXj)] 


where J(x) is the Jaeobian matrix of the map 

J{x) = {dF/3c) with ji(n)> j 2 (n)... > jp(n) are the magnitudes of the 
eigenvalues of Jn. The Lyaponuv numbers are 

Xi = limn^oo , i = 1,2, ...,p. 

The Lyaponuv exponent is the smallest, positive, real nth root taken. The 
Lyaponuv exponent is the log of the Lyaponuv number. 

The Du(T) and Du_(T-c) ealeulations for the “Entropy Lambda” sheet are based 
on the derivative of the trend-line’s equation from the eumulative entropy graph. The 
derivative of the trend-line’s equation is taken and then the time interval replaees x. The 
following is the equation used; 

—\y = cx* 1 = 

dx^ ^ (1.4) 


A usage example from the Ada dataset follows in Table B4. 
Table B-4 


Time Interval Derivative Caleulation Example 


TimeT 

Du (T) 

du (T-c) 

du (T-2) 

du (T-5) 

1 

0.70152 




2 

0.388653426 

0.70152 



3 

0.275126706 

0.388653426 

0.70152 


4 

0.215320284 

0.275126706 

0.388653426 


5 

0.178040011 

0.215320284 

0.275126706 


6 

0.152424645 

0.178040011 

0.215320284 

0.70152 

7 

0.133664638 

0.152424645 

0.178040011 

0.388653426 

8 

0.11929092 

0.133664638 

0.152424645 

0.275126706 

9 

0.107900992 

0.11929092 

0.133664638 

0.215320284 

10 

0.098637046 

0.107900992 

0.11929092 

0.178040011 

11 

0.090943882 

0.098637046 

0.107900992 

0.152424645 

12 

0.084445719 

0.090943882 

0.098637046 

0.133664638 

13 

0.078878805 

0.084445719 

0.090943882 

0.11929092 

14 

0.074052371 

0.078878805 

0.084445719 

0.107900992 

15 

0.069824897 

0.074052371 

0.078878805 

0.098637046 
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Note, y = 4.7404x°'^*‘^ 

du(T) = (4.740*0.148)*T^(0.148-1) 

du(T-c) = (4.740*0.148)*(T-c)^(0.148-l) 


Lambda Calculation 

The lambda calculation is dependent on the time interval derivative calculations. 
The equation to calculate lambda is: 

K=iPf'i^)T n o 


Where 


/'(x) 


is substituted with: 


du 




dt 


dx ^du^^du^ 
dt dt 


substituting for f'{x) we get. 


/^o — 


du 


,1/3 


P- 


(l-g) 

dt 


P- 


du, 


(i-g) 


du 


dt 


+ - 


(0 


dt ) 


( 1 . 6 ) 


(1.7) 


The values from the time interval derivative equation (1.4) are placed into (1.7) 
with varying P values (e.g. 0.1, 0.2, 0.5, 0.75). Table B5 shows an example of the lambda 
calculation. 


Table B-5 


Lambda Calculation Example 


TimeT 

Cum Entropy 

Du (T) 

C y 10% 

Lambda p 

10% y 

P 

10% y 

1 

4.48385619 

0.70152 
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2 

5.406900167 

0.388653426 

4.87216694 

0.534733226 

0.1 

3 

5.805013635 

0.275126706 

5.306648158 

0.498365477 

0.1 

4 

6.057909749 

0.215320284 

5.574025253 

0.483884497 

0.1 

5 

6.082181413 

0.178040011 

5.60612135 

0.476060063 

0.1 

6 

6.106601538 

0.152424645 

5.635448868 

0.47115267 

0.1 

7 

6.355700897 

0.133664638 

5.887915573 

0.467785324 

0.1 

8 

6.52682382 

0.11929092 

6.061493127 

0.465330693 

0.1 

9 

6.549095131 

0.107900992 

6.085633441 

0.46346169 

0.1 

10 

6.611519798 

0.098637046 

6.14952886 

0.461990938 

0.1 

11 

6.64290191 

0.090943882 

6.182098576 

0.460803334 

0.1 

12 

6.725985485 

0.084445719 

6.266161229 

0.459824255 

0.1 


Note. “C_y_10%” is found from “Cum Entropy” minus “Lambda_ p 10%_y” 


Lyapunov Exponent Calculation 

The Lyapunov exponent calculation depends on the trend-line equation from the 
map of entropy at time steps k and k+l. The derivative is taken the same way as in 
equation (1.4). Once the derivative is found the time interval is replaced for x. A usage 
example from the Ada dataset follows in Table B6. 


Table B-6 

Lyapunov Exponent Calculation Example 


TimeT 

Cumulative 
Entropy K 

Cum K-i-1 

Lyapunov Exp J’(k,k-i-l) = 
0.724*1.720 k^(0.724-l) 

1 

4.48385619 

5.406900167 

0.823021444 

2 

5.406900167 

5.805013635 

0.781579695 

3 

5.805013635 

6.057909749 

0.766403215 

4 

6.057909749 

6.082181413 

0.757435961 

5 

6.082181413 

6.106601538 

0.756600505 

6 

6.106601538 

6.355700897 

0.755764222 

7 

6.355700897 

6.52682382 

0.74747023 

8 

6.52682382 

6.549095131 

0.742009203 

9 

6.549095131 

6.611519798 

0.741311905 

10 

6.611519798 

6.64290191 

0.739373453 

11 

6.64290191 

6.725985485 

0.738407755 

12 

6.725985485 

6.817516503 

0.735878946 


Note. y= 1.7208x°-^^^' 

dx = 0.724* 1.720)K<°-’^^''^ 
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APPENDIX C - SABOE DISSERTATION EXCERPTS 


The following pages were taken from Dr. Michael Saboe’s doctoral dissertation 
and are referenced in this thesis’s body. 

Each of these subsets represents a possible that a researcher nia\ find tins 
mes-saee. Often, as we know, we use only elements of some research, that is single or 
double or more sets of terms Each of these are legitimate accessible states of the 
mes.sage The higher q-le\el temis can be \iewed as higher level concepts. You can see 
bv inspection that it is not possible using this approach to haie a q=3 tcmi without filling 
the lower level q-le\el states .At .some point the combination of a q=2 or q=3 set of terms 
can lake on meaning as a primitive temis in and of itself These higher q-level sets take 
on the meaning of a higher level of abstraction They can then be coasidered 
rcprcsenlatioas of a concept which rnav be mpbeed bv new single temis 

At that point, it becomes a q=l set We would expect that the higher lev el q sets 
will e.xhaust when thev become more and more frequentlv used. This seems to be 
consistent with the abstraction discussion (Whitehead l*:^IOK and learning models 
(Newell N80) (See 11 .Abstraction, p91, and I(J Learning Cuncs, pW and 

( hapter 111) Shannon (l‘MS) illustrated this using a telegraphy notation where, a birth or 
death vvas simply ^presented bv a few terms. The receiver understrxxJ that those few 
temis implied that a babv bov was bom on a certain date, and other appropriate debils 
Wc do the same thing when vve learn We follow an economy of svmlxils and the 
principle of lea.st effort (Zipf hM9) discussed earlier 

We could look at the entire message of the publicatum (the article or report), and 
vve could, in fact. Unik at everv temi in the |)ublication and determine the frequenev of 
iKCurrence of the set of sets of temis If we were hxiking at every temi in a message or 
report, vve could al.so populate the lower half of the matnx This would pemiit the 
dctemiIllation that |.AB[ vvas different from |B.A|. because (AB) has .A preceding B and 
the reverse is tme in the case of | BA | Tliis is how the analvsis would be done for a free 
text study 

For the jiurposes of this experiment, vve are using a bibliographic record, and onlv 
examining the desenptors In a descriptor field, vve would not expect the term to be 
entered more than once, and the order is not significant in the data source used in this 
study We do assume that the temis in the desenptor field are representative of the topics 

covered in a message Further, that the message terms in this field are .symbolic of the 

-67- 
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FLg'UTt: 111- lO Intnnctini^ A nnd D 

[n FiuLico ]II-J2*'^, wu lion iIiliI ai; s'vsLuni A chn nLunbor of EomiH 

dLKOo\'cix;d at lluj !;amo mlO ibal ihL; nuinhnr of El; rmii undij^covoroJ dL;-4:n;aKL;K. 

This irtodol sarisficK our consunatLon printL| tJc for oxttiiKivie quanliEits 

Ntisl, in Ft^uru Jll-l^, wt examine iht: untmpy [tLaLiomihLiT. Tht horizonlal lint 
nl [he lop of ihe fiflure is llit joinl enlrtspy of Iht sysltm. Sinct this is a citsstd system, 
[his is noi chaji^in^. hofvtvtr. iht iiilemnl diKlrihuLion ivill than^u. Thai tniJitpy rtlaltd 
Eo KubsyNltm A it ill incroast as iht aiu nuiro and moju -chuiooN lo make in onJtr to ^1 
tcjmplele informalion. fiuhsysiem G ivilL det-rtasu from a lii^ tnlntpy CatL of the 
unkmm n Etimsll to a hm tr tnLropv as [here btcomes loss and Its* left lo he diseovtred 

[lit LtiaiTi^ ut [hi;i iCLliert Icil1i:i 3 da1u ui iljuaruct dit ^ckiilI rekiLion.duiJiL AlcluI 
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The eu rv c ifhows ihe muliia] i nfomlalLon , When llic disLanec bclwuen the denlLir f)f 

the Live probability masKus. ur subfivKlenlS. dL:cnjiTSL;H, ihem; lh a hiijjidr ccsFrulaliun. 
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FiuLire 111-11 Subset of an alphabet in prtui intdraelinj; KYHlems !M arul ??? 


Messages in Two Subsystems 

Irti aclii j S^hns Aand B |'C>arubit ii T^lil Syatan AB| 



Figure Dl-J 2 Messages in Iwo suhsyslenis 


diiuijLiuu^ fera lia.:liruiki$y aje aLuws in Citiojitisr IV. anJ lheap^^dts. 

- JI7‘ 
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Entropy vs Messages 
Two Subsystems 


Enlropk' S In temc dng A nnd E 


&CK) A = -SE-OsV’ + 0jQ22Sn + d.SS3! 
= 0J92^ 



if ssr ^ ^ ^ ^ ^ jSS ^ 


n HassjgAi A 


Fisjucc I[[-I3 Enlrofjy vs. MtisHauui T'l^o inCL’raEtiny SysCt-ms 


FuJIu^vinu JEa^xjiimg Hiinibr L[) thjL us*JAj in .slziLi k ideal, and EnnclunHeJ paniele 
phj sicH (Sehnx.'der 20t)(inFraund(3Tff 2000), v<x- can find sume UHufnl ralalionships. The 
slopa af the enn'ES ef ihe twcj snbs^'sCams gives us Kumu inip>4.>Tlan[ inliiTmaLion aJicjul 
thunnal uciuiLihiriuni. RenaLI from ihu nannnieaJ unsumblu discuHsinn of free uneigy, dial 
[liu lempLiraLurt; 7'lh the [Taramuler e^mCnjlling free uneriiy, Or [he e^jnsurved [mipiertv. In 
[his easu of idUsse^^h, \vu can vvri[e 


T i\ii 


if 25) 


So Lhu [empettilure is celjilud U> slope of [he chanue in enlntpy verses change in 
messages curves. When the curves in the figun: Cross over, the systein is al an 


- ]l?t- 
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Librium [KiinC l^k ^ ^uncraL rglaliou^hip LbiaL lLi'L' inciua^u in Ltnu 

liystcm iH icLaruJ Lo iht; nu^Livie .Hlopc ur, Lbii; Jutrua;};; in ihu oLh4.‘r. 


A.y,_ AS-, 

.ij?j Aflj 


<J.26) 


Thu intrumunCaJ chan^- in dih'idud hy lLk; dian^i: in, JJ/ nnjKsnguSj is uqual tu 
thu chanuu in unlnupy, })ff. for s^stum ^ a^n ujmpjK'd lu LbL; chan^C- in ihu (;c7nHUJwd 
quanlily, in Ltiin casL': ;?j. Ruxviiliriu wu 


A.S-J ^ AXs 

Aflj 


271 


Thu suunnd liunri biaH a fi in |hu numuiator and in [hu dUnciiTiinatr^r. is thu 

samu as since tthal ue discover in mefisj^'s is Lhe same as what is remeved fnmi 
thu undiseuvuied st'iitum Wu can rewritu this fi^r a syslum at uqui Librium as 


A.y, _ Ay, 

Aflj -'^h 


o m 


Thu ihin^ that is Lite same for Itelh sysieins when ihew' are al thurmal UL|LiilibriuiTi 
IS the sltspe of thu unLrofiy Tnessa^g Jiraph. This sirjpe must somehotv bu related to the 
temperature of ihu sjsLum The 2*^ Law of thumiodynamics lells us tltai thu c^insuned 
property nvill tend lo flotv fRfO the subsvslem with Lhe entropy es. messa^ i^JtipIt. 

and OifI of thu obiect with die unliDpy vs message ijiaph (Schrtxsler 2tK10 pK7f 

Aceonlin^ to Schmedur, ttie fonnur ‘Wants to’’ j;ain thu free consereud properLy 
(messages} in older to increase its entropy If there is an intt^aLanee between the two 
subsystems, the Latter doesn’t ho much "‘mind" losing a (ehv messages |since thu entropy 
will not deereasu much. steep .slope must corruspond to a low lempuiature. while ^ 
shallow sLope eorresponds tum[>eratLire. 

Now we ean see in the ktwer uur\'u of Figure 111-14. the mlalionship of the 
temperatu re t the right tuind >’-a\is) of sub-ststem A as the partiliou moves csver the time 
stents. Mom activity i nerep'seH the tempumture. The tuniperatu re is ntuasumd in degrees 
as vve would in a physical system; however, these degrees are deveiinped frem 

- 1 |t.? - 
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inf»rniLiti(]n uiiitK. This is “tht:’' fun.damLnt£iJ Ujmptnilunj unit diivitltspud frim] ilic- 
reldliunKhip of Lintrupy, ajid thu conscrv'ud qunntiK 

N(jH; lhal ihcrc aii; tom pcrglun; fliKLualiiunji This is tansisLont wilh Prii:cj^nL;''s 
obiion atio]] about (.•'^'olvinj’ .nysloms A dynamicat systum \sil] liolp explain these 
flnotLiaLicm^. 

Pressure and Temperature 
vs timestep 

T«nip<irjrur4 jnd Pressure Tim»t«p 



Fijjuro 1M- J 4 Pressnix;. and Tom peialmi: ^Saboo^^J vs. Lmo — Livo interaclin^ 

K\sLeniH 


PnjsHuro is defined j.h ihe <m(,'s;sa^L's;' piotoHsed per nttde, tvhun; the "^miessa^os> 
nepreseni the aveiago in tho limo step pOr nirdu. The imptsrlanl observation is not 
noDos'sarily th-o form of the <x|Uiilions or the goodness of fil, rather, dial fhs pressure tlUfr 
be !:een Ut fncreuxe m tbe lempi^riiltire fttcremes. UHiile mossagos are noi ph>sica] 
nitdoculoK as in a themttidynaniic system, thov seem Lo hehavc as a g;is might, as the 
temporattire uoos up ihe prossure goes up 
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T^rr^aura iB] nugrcMij 

































Figure III-I5 shows the reiaticHiship directly between pa'ssurc and temperature. 
This was de\eloped b\ taking the curves from Figum III-14 and setting them both equals 
to k. Then the Pressure P(T/ as a function of temperature is determined 


/'(*)= flip 

(3 29) 

nk)-b, 

nip 

(3 30) 

Similarly, sedve for A as a function of T. 


T(k) = mjki-bj 

(3 31) 

T{k)-b, ^ 

(3.32) 

Then w e get 


P{k)-K nk)-b, ^ 
nip nir 

(333) 

/>{k)=!!^(nk)-b,)+bp 

(334) 


When plotted in Figure III-15 is the tight set of points indicating as temperature 
increases, pressure increases Figure 111-15 shows the raw data points as well. These 
fluctuate around the PiT) calculated data, would be expected 
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Ada Distribution of Messages by q-ieve! 

q lo'i'ol OfslTiburion 



1 2 3 4 S G T B P 1D 11 I3 13 14 1S ie IT IB IB £0 21 3T 13 74 

q>»>l ub torn, north.[ha 


Fi^n: ]ll- 18 Svt uf sets dislribuLion uvvr time steps by q-leve! 


3. Tcchn dJ [i|^v TrimsFer'Thnnnel Elements 

\Vu CociHider Euo easefi. The delermLciLHtic i^se repiesents tlie micrusetipie 
le\'el of the nieckl in ihe system, und the stuetuiHtic 4::aKe repiesents the m^tenuseeKpie 
svStem s'ie'i^ . Su far. sve bas e enly addressed the macroseopie ease-. The deLemuiiislie 
ease wuuld ee^ur ai the mieiD Jes'e] in a pno^mm,. nr a system mitde up ef nudes 
eensistini’ uf a family of machines. A stoetuHlie system eonsislS of a pupabLion. 
coarsely pactiLiionjed at die maercjseupic ieve]. This is a system made up uf a socia] 
envirunmenl eunsistin^ of people and un^ii LzjLictns The riL'£',^]7T models address Hie 
mure ^enerit case of die sLochastie system of nodes eonsisLinp of people, uruanizaliims 
and machines. 
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Wc define the communit>, the macro structure, as a set of performers that produce 
output An organizational is mode up of a set of the performers with which they are 
affiliated Wc can think of the micro level in terms of the performers The organizational 
level is in between the macro and micro levels and can be thought of as an ensemble of 
affiliated performers We can observe individual output from the data Each lecord 

contains primitive messages published by a performer, ^‘■.contributes information to the 
community. This is defined as follows 

r 

.T = [J .1, is the ammunity (3 36) 

*-i 

-V, where x^,x^„.x^ arc the perfi>rnKrs of the i* nryanizatiun 

(337) 

.V, is the oiganization. and i = l..p (3.38) 

The output entropv is allocated from the message to individual author subset 
performers from the empirical data This micro level is then summed up and allocated to 
the to the affiliated organizational level. The organizations are banded based on a 
distnbution of the cumulativ e number of publi.shed messages 

We consider a familv of nodes (machines, and people - the atomic level), making 
up organizations (the molecular level), and a communitv (macro level). In a band, we 
assume all of the nodes have the equivalent properties, i.c. each organizational node, 
comprised of performing author nodes, are statislicallv equivalent Figure 111-19 and 
Figure 111-20 illustrates a node taking information in as input S(X), performing some 
transformation. F(X0. to produce more mc'ssages (work products). Part of the output is 
expanding the mutual information I(X:Y} intersection of the Venn diagram, and part is 
augmenting the vocabulary, This augmentation is the conditional probability SfYX), as 
we saw from equation (3.15) 
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Node Input and Output 



Fi^uru III- 19 Input bi^in^ tonvC-rfejJ vin a IranKfcrfiinulion lu output 


Node Transform of Input to 

Output 



Fiyuni lt]-2() Nixlc Irailb-ftjm of ]ripuL tu OulpuL 


- J2*l- 

















The initial txind determination is computed based on the accumulation of 
experience of executing tasks, i c. publishing messages. The most prolific performers are 
banded together based on the average number of messages pnxluced over the period 
examined. Later the learning, or performance index is computed for each band at every 
time step from the beginning of the data set to the (currentl perfoimartce time step. An 
example of the distnbution is shown in Figure III-21, 

We will perform a coarse partitioning of the performing organizations into four 
bands Further, partitions am possible, however this is sufficient to demonstrate the 
approach. The “A" band consists of ail of the organizations that were beyond 3a in the 
rate of production of messages in the sample for a given technology. The “B” band arc 
the organizations in the 3a partition The “C" bond contains the organizations with a 
message production history in the 2a partition, and the "D" band are all of the 
organization below 2a in performance. 
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Productivity Distribution (Sample) 



Figure IIE-21 Oiganizatitsn DiHlribution into Ciimulati^'L Tasik PorftMmud Bajidsi 

Our pnjblein is Its rtsiltiu, or at liiujit to approxiinalu, a yivon HVHloin, uhich uo 
caJ] the tmo sWom, by a mudeJ Wo adjiiHl Llio pjcamoiorH vaJuuss ba.HoJ on a nuaiExjr of 
examploK ptovidod by obsurva[j(3n uf tlio true iry^stum. 

Thu analvHU^ of thu panilion<i. c^n prxjuuuJ L\ac[]\ aii diu analysl'i of Lho nijcro 
lovul oomuiLinily. ThiK ihu boauLy of the paitilioning. Wu only havo L? bo oauliou^ of 
combining bands tthon [ho mun[s of EunriK, (mullLplLcitv of .Mtalo;;) ato. "lcM:ar lo the band 
under o:\aiTiinalion. Wo otmnl inos;sugoH in a hand and dovoJop tJio pnihabi]]tioH, and 
honoo die untcupy of tbo band Lii based nn Llio lotaJ number af mossago^i in dto band. In 
tjtdor to agujugate band;;. \^'o oonsidor ibi;; onlropy tbo band s oontribulion to die total (all 
bands) ontn^y There is an enlrtipy ctinlribuLion simply resulting fn>m ihe piartition. 
This ijimlnbuLitjn varies every time slop basod un the internal organizalicm of llio 
messages.. -eunsUtuonl lonns, ;ind nodes. 
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Node Input and Partitioned Output 



I 

.S’ +L-diag. 

ii 


— 

LI, 


1^ JqI muftiplicity cf temjs in D Sand = T h e e lit ropjy C Jl ICLII ate cl 
C.jfs contnbotfon rj Band i '' locally for Band 


PiuLirc 111-22 Pirtilions ofu-ulpuL inUh bandji Conlribuliun to iho Conirnunity 

Earh hand, j, provides a conlrihmcion, C^ to iho conlmuiiity onlnopy. Thi; local 
band cntrOjiv .S'j^ , ma.Hl be scaled based on ibe mukiplicily il, of lorms in the band to llio 

nitilliplidty il of tonus in ibc tvorld The canimtiiiiLy. which is somclimcs referred to as 
the technology's ■^vo^]d” entropy Is ihe sum of Ihe coniribution.H 

□ ikia^r 

S„ = £ C, (3 JO) 

l-l 


^^hciu r.=-^.*iu 


iil 




(3.4fii) 


This relationship penniiH □giu.rcgution. of pnjvious rcsullH on a subset of a 
community ^^ith more iiiftjnnatlim later without having to remn the entire world and all 
prev'iouKly analyzed hands. .411 that is ix:4juiTud is the count of the instances of tenns in a 
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band nnd the euunl uflhij number uf in:iLni]cei; r^fiennH in Lhi:; □Li^n^eiiled hy thusie 

teim^. 

Lifer e\lenHionH Lc; bo oun^idemd hvtnjJd tiddrefin dl] of Lho ^'nrioLi.H cnmtnn^ioniT 
of author nodes prudueini^ a Fur o.'^amplo, lho porformOrs cimld bo 

ropnj.senLed as eombmalions uf julhoni produeing a record (^ehich as was ptnnlod Oui, is 
btKikcn down inio its primitive moHsa^cs al various q-luvels)-. AddiLionally^ wc could 
assume ibiaL if Lbeie are three julhuni on a leCurd, they ropieseoil 2 possible author 
subsets — nodes. Each subsoL is a Je^ilimatc eombiiiatLon producing the messa^os This 
dislribuLi(?n develops in osacLiy the same ivay as the term JistribuLion of seLs of sets aS 
developed. Tlio ahilily L[» oalcuLale ihg eonEiibution with a raiLio r»f l1ic ItKal svsteju 
instances to the mic'Ctislates of a lai]^er or smaJler system, it was ofL:]] useful to eouiU 
insLmees of stales, By eompulina the enLrO'py loeally, these chunks can bo eombined 
with other suhsvstems often with out additional eompLi1atH»n. 


- Jj2- 
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opporlunily lu rtljln; tin; tuu ™injpy miiaiiLinL.'*, Sm and Sa Kinci: Lhi:y am reialud to iho 
Sojnt; infnrrnalitm ivstom of cnusaa^^H W'u aitt dcalLnii i^ ilh thu samo infomiJliLin 
huncL! l]iu su ihin rtMiKucuibJt:. RacoJI is inlalrJ Id 

Lyapunov'& u^pran^nC X, uhinh comics frusm the ei^nvaLue f. 

Wn found Lho robtionjihip of muHiasjoH- ^rrio limo slop in Fignm II1-8 ^vas vor\' 
salisfajclc^rily modoLud as a LinOar equation for ihis leehnobtwy snE. I[[ muld be dtfloreni 
for olhur teohnolo^es^ this is tthy have dealt ivilh the ivlalionships in loims of 
fLinetiuns, eiuenvatues and duiivaEivus.) In t1iis na-sO. the dunvalive oF Lho Linear mt'Kle] 
lodueud to a constant in UtiLiaLion (jl.72) an noted uarlier. 

Nyw inslead of usiny an average, or ^ush for 0. it is cyntpuLd dimclly. To eontpule 
0, ihu amount of inlorniaLion that a node Oonsumes which persists in lime, hoth equations 

are a funclion of liniestanip. so ivu can solve =hyk ^ and 

— . for t Setting them both equal to eaich OlliCr, ive can solve far 0{SB.Sii) 


s\ 










(? 74J 
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zimJ 


/ . ^7J7J■ 


.4 


h 


= k 








Nr. 




I 

"s 




/ \ 

A 


% 

in, 


( 3.751 


(3 761 


(3.77) 


Htn; [he subscnpCs x lufura ty .nJope and inLereepC [erm^i tif [hi; Shannon onlrOpy 
equalLtm, and tho JinlKcriirt / is refiimn^ to similar [oimH in the balwrn [ranHform 
equation. 

Qofdn; do, IoI'.h o^pLcjEU tho mlalionFihip lo tompenaluri; frinn Ltio disi;relio, 
miem, modol. EaHiur, usin^ a mai:n;Hcopic appn^ach, ivo showed [hat tnmporaLnn: 
indi&a.Hos. or doi: neases widi inoreasin^ or Junmlisi nu pfi;^iSLn\: on a nodo mspeclivoly. [ii 
a pliysioal svHloan, wC oaji aJdrOsH lonipuraluci; in ofontropy and oonHcn-iod pnjpurty, lu[’s 
soo lhal this in: Inn: (dr this JisOmlo. micro (iiimLilaLion as wclJ. 

(n Fi^ro I]l-‘30., hvu sou on tho ]ufl-hand side, [hat thunu iH a [nnsfur funcliim lhal 
ctmvuits .1' in^Tul, or some pura;nU|5c- of tlio available puniisEunI in}™[, into output. 
ThiH i.H really made up lof two part.s as Huun on ihu ri^[. We can use a Venn dia^tam a.s 
intrcxlucud earlier, [hrlcnsive properties like messje^es are additive. Probabilities are 
niultiplieatii'e. This also applies to the entropy. 
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Java relationships 


Jm Tjtptcnd HI locH 




Jwa GHb Hid'MnMicre' 



Ph D Defense 2002 

Figure IV-15 Java Relationships 

For an cariy Ada example seen in Figure IV-16, we can observ e ihal both curv es, 
the curve for the Lyapunov exponent and Shannon's entnrpv have the same power law 
form By obsenation, vve sec both of the entropv measures as a function of time step. 
Sn„ the information theory entropv measure is on the left v'-ar/s, and the Su which comes 
from the eigenv alue of the micro control model (hence in the range of 0 to 1), is on the 
right hand ,v-a\is The scales were adjusted to easily see that both curv es arc of the same 
form In addition, vve can see for this early data set. that the Ji~ values are reasonable, at 
0.96K for system level entropy and 0.96 for the bakers transformation /. We can see that 
as the system entropy stabilizes, the eigenv alue of the fc'cdback control dy namical sy stem 
is ai.so .stabilizing. 

initially, to determine the fitrm of the functions, the average value fi^-10% was 
used. This was done by iterative guesses of a fixed fi. This approximation of fi was u.sed 
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E(.) HaliHfy ihu matroKttspii; lylL: csf change cnEiD^TH' THIh suggewEs Lhal hvc have the righi 
form trf Ltic dvnomical sv'sLem control model malehed lo die macrosic^ifiLC Kyslem mixlel. 
ThiH also suggcHls ihal the model does ajifiroximaije lliu obsened condilions 

Entropy and Dynamical System j 


EtjU op/ {SB} f{j. fi,J 



Figun: IV-16 Macro Equilibrium and Eigem'aluc/ SlabilizaEitin 

From ^3.77|, ^^hicl] develops ShanncMi enliop>' nmv in terms of Jt, vvhicb we Lno^v 
from (3.7fJ)i is a funcliou of 0. Al this piTinl Ji adjusted unlil the entrofiy (eigenvalue) 
of Ehe disercle mi^del malched the macroscr^pie enEmpy of the informalion ihc^uetic 
rntxlel. In each time step, the tolerance on the two melhods of com|TU[ing the entropy 
were malchc'd to within D. 1%. This is soen in Figure IV-17. The upper curves (the hvu 
are tjuperimpoHed) represent enlrchpy converging at the same limestamps for the system. 
The lower curve represents which changes over time. TTie secondary y'-a.\is. on the 
right givesas a [!iet\:entage 
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Introduction 


Purpose 

The purpose of this document is to specify the requirements of the initial version of the 
DataThermometer application. 

Document Conventions 

This document will give most of the requirements at a high-level of abstraction. The 
detailed requirements will be appended in the future. 

Intended Audience and Reading Suggestions 

This document is intended to be read by the software developers and the stakeholders 
discussed in DataThermometer’s Vision and Scope document section 4.1. 

Product Scope 

Please refer to the Vision and Seope doeument. 

References 


Title 

Author 

Version 

Date 

DataThermometer Requirements Elieitation 
Doeument 

Matthew J. Behnke 

N/A 

4/2/02 

DataThermometer Vision and Seope Doeument 

Matthew J. Behnke 

.5 

4/2/02 

DataThermometer Use Case Doeument 

Matthew J. Behnke 

.5 

4/2/02 


Overall Description 


Product Perspective 

DataThermometer is a new self-contained, stand-alone, application. 
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Product Functions 


Feature 

# 

Feature 

1. 

Imports data from Microsoft Excel spreadsheets. 

2. 

Stores the data in a relational database. 

3. 

All functionality is performed by the user through 
DataThermometer’s graphical user interface (GUI). 

4. 

User-selected data processing methods are applied to the data. 

5. 

The results of the data analysis are in the form of Microsoft Excel 
spreadsheets. 

6. 

DataThermometer keeps track of the data processing methods 
already ran on a dataset. 


User Classes and Characteristics 

See stakeholders in section 4.1 of the Vision and Scope document. 

Operating Environment 

DataThermometer must run in a Microsoft Windows 95+ environment and must be able 
to interact with Microsoft Office applications, Excel and Access version 91 +. 

DataThermometer’s minimum hardware requirements are a Pentium-Ill or equivalent 
with 256 megabytes of RAM and 100 megabytes of hard drive space. 

Design and Implementation Constraints 

The following are design and implementation constraints: 

Minimum Hardware Requirements: 

Intel Pentium III or equivalent, 256 megabytes of RAM, 100 megabytes of hard drive 
space. 

Language: 

Microsoft Visual Basic 

Data Storage: 

DataThermometer must store parsed data in a Microsoft Access relational database. 
Results of the DataThermometer’s analysis must be in the form of Microsoft Excel 
spreadsheets and charts. 

Maintenance: 

The finished product will be maintained by the development team. 

User Documentation 

User manuals and on-line help will be supplied as part of the initial release. 
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Assumptions and Dependencies 

See Vision and Scope document section 2.3. 


External Interface Requirements 


User Interfaces 

DataThermometer is intended to be used by technically skilled users. 

The user’s goals are to import data, perform data analysis, and review the results. 

The following items are always shown in the top tool bar of the window: 

• File - contains subcommands such as Open data file. Close data file. New Data 
File, and Exit program. 

• Import - contains subcommands to build an import filter and import data using 
the filter and a command to import directly from Excel. 

• Analysis - contains subcommands to perform data analysis methods on the data 

• Results - After a data analysis method has been performed on the dataset 
subcommands become available in this menu to view the results of each 
analysis. 

• Help - Selecting this command will open the help screens. 

Keyboard shortcuts will be implemented for each menu 

The window area is reserved for user input / selection of any details a sub command 

requires. 

The following figure represents a sample of what the user interface will look like: 
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Hardware Interfaces 


N/A 

Software Interfaces 

The interfaces between software components will be handled with Visual Basic functions 
to tie DataThermometer to a Microsoft Access 97 database. Visual Basic functions will 
also store data analysis results in Microsoft Excel 97+ spreadsheets. 

Communications Interfaces 

N/A 


System Features 

The functional requirements are shown below. To see the stimulus / response 
sequences see the use cases in the DataThermometer’s Use Case Document. 

DataThermometer Functional Requirements 


Table 1: User Interface 


1.1 

All functionality must be able to be performed by the user through 
DataThermometer’s graphical user interface (GUI). 

1.2 

The GUI shall contain high-level commands in the tool bar with 
subcommands underneath. 

1.2.1 

The high-level commands shall be contained under the following 
menu headers: File, Import, Analysis, Results, and Help. 

1.3 

DataThermometer must allow the user to cancel an operation 
during a state that requires user input. When an operation is canceled the 
program must return to an idle state. 


Table 2: High-Level Functions 


2.1 

Users must be able to perform operations on a data file. 

2.2 

Users must be allowed to import data from Microsoft Excel 
spreadsheets. 

2.3 

Users must be allowed to perform data analysis on the dataset. 

2.4 

Users must be able to view the results of the previous performed 
data analysis. 

2.5 

Users must be allowed to access DataThermometer’s help 
documents from within the application. 
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2.6 

Users must be allowed to exit DataThermometer. 

2.6.1 

DataThermometer must close any open data files before exiting. 


Table 3: Data File 


3.1 

DataThermometer must store information about a dataset in a 
DataThermometer Data File. 

3.2 

Users must be allowed to open an existing data file. 

3.3 

Users must be allowed to close an open data file. 

3.4 

Users must be allowed to create a new data file. 

3.4.1 

DataThermometer must prompt the user to name the datafile. 

3.4.2 

A newly created DataThermometer Data File must have the 
extension “.dt” 

3.4.3 

DataThermometer must store the name of the data set. The user is 
prompted for the name of the dataset. 

3.4.4 

DataThermometer must create the Access database used for the 
dataset when a new datafile is created. 

3.4.5 

DataThermometer must set the name of the database to: <the 
name of DataThermometer’s datafile>-db.mdb 

3.4.6 

DataThermometer must store the name of the database in the data 
file. 

3.5 

DataThermometer must store an identifier telling which analysis 
functions have been ran on a dataset in the datafile. 

3.5.1 

DataThermometer must store the names of the Microsoft Excel files 
that contain the results of the analysis functions that have been ran in the 
datafile. 

3.5.2 

DataThermometer must store the date that an analysis function ran 
on the dataset in the datafile. 


Table 4: Import Data 


4.1 

DataThermometer must be able to import data from Microsoft Excel 
spreadsheets. 

4.2 

DataThermometer must import data from Excel into a Microsoft 
Access database 

4.3 

DataThermometer must let the user select the sheets in an Excel 
spreadsheet to import into the Access database. The default is all sheets. 

4.4 

DataThermometer must allow the user to change the name of the 
imported Microsoft Access table. The default value is the name of the sheet 
in the Excel spreadsheet. 
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Table 5: Analysis 


5.1 

The user must be able to select from a list of data processing 
methods that can be applied to the dataset. 

5.2 

The results of the data analysis must be stored in the form of 
Microsoft Excel spreadsheets and charts. 

5.3 

Analysis must be based on the calculation methods previously 
implemented. 


Table 6: Results 


6.1 

DataThermometer must allow the users to view the results of a data 
analysis function if that function has been ran on the dataset. 




Other Nonfunctional Requirements 


Performance Requirements 

N/A, TBD 

Safety Requirements 

N/A, TBD 

Security Requirements 

N/A, TBD 

Software Quality Attributes 

Correctness - The methods implemented need to be correct in order to provide a useful 
product. 

Maintainability - The software needs to be written in a way that will support 
maintainability and evolution of the product. 

Reliability - DataThermometer, like any application, needs to be reliable. Meaning data 
analysis methods will be able to process to their full extent without error. 

Usability - The application needs to apply to a wide domain of data. If this can be 
accomplished then one of the main goals will be satisfied. 
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Appendix A (of SRS): Glossary 

Dataset - a collection of data. 

DataThermomemter Data File - File used to store information about a dataset (e.g., 
name of dataset, whether or not data has been imported, type of analysis methods ran 
on the dataset, etc.). 

Import Filter - A set of rules that govern how DataThermometer parses a data file when 
importing data. 
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APPENDIX E - USE CASE DETAILS 


Initial Use Cases 


for 

DataThermometer 


Version 0.5 approved 


Prepared by Matthew J. Behnke 


United States Army 
Tank Automotive and Armaments Command 


June 2002 


Revision History 


Name 

Date 

Reason For Changes 

Version 

Matt Behnke 
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0.5 
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Use Case ID; 1 


Use Case 
Name; 

Open Data File 

Created By: 

Matt Behnke 

Last Updated By: 

Matt Behnke 

Date Created; 

6/16/2002 

Date Last 

6/16/2002 



Updated; 



Aetor; 

User, DataThermometer, Data file 

Deseription; 

DataThermometer prompts the user to select a data file. 

Preeonditions; 

DataThermometer must be loaded and in an idle state. 

Postconditions; 

Data file loaded. 

Normal Course of 
Events: 

1. The user selects the Open Data Eile command. 

2. DataThermometer checks to see if a data file is already 
loaded. 

3. A file is loaded. DataThermometer asks the user if it 
should close the file. 

4. User responds “Yes” to close the file. 

5. DataThermometer closes the file. 

6. DataThermometer displays directory tree. 

7. User navigates directory tree. 

8. User selects a file. 

9. User confirms file to be loaded. 

10. DataThermometer successfully reads the file 

Alternative Courses; 

3. A data file isn’t loaded. 

4. DataThermometer displays directory tree. 

5. User navigates directory tree. 

6. User selects a file. 

7. User confirms file to be loaded. 

8. DataThermometer successfully reads the file 


6. User doesn’t find desired file. 

7. User cancels Open Data Eile 


8. DataThermometer can not read file. Gives an error. 

Exceptions; 

TBD 

Includes; 

Close Data Eile (Use Case ID; 2) 


Assumptions: _ 

Notes and Issues: Exeeptions will be determined at a later date. 
Requirements: 1.1, 1.3, 2.1, 3.1, 3.2_ 
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Use Case ID; 


Use Case 
Name; 


Created By; 


Date Created; 


Close Data File 


Matt Behnke 


6/16/2002 


Last Updated By; Matt Behnke 


6/16/2002 


Aetor; 


Deseription; 


Preeonditions; 


Posteonditions; 


Normal Course of 
Events; 


User, DataThermometer, Data file 


DataThermometer eloses the data file. 


DataThermometer must be started, be in an idle state, and a 
data file must be loaded. 


Data file is closed. 


1. User seleets Close Data File 

2. DataThermometer eloses the data file. 


Alternative Courses; 


Exeeptions; 


Ineludes; 


Assumptions; 


Notes and Issues; Exeeptions will be determined at a later date. 


_Requirements; 1.1, 2.1, 3.1, 3.3 


Use Case ID; 

3 

Use Case 

New Data File 



Name; 




Created By; 

Matt Behnke 

East Updated By; 

Matt Behnke 

Date Created; 

6/16/2002 

Date East 
Updated; 

6/16/2002 


Aetor; 

User, DataThermometer, Data file 

Deseription; 

DataThermometer ereates a new data file. 

Preeonditions; 

DataThermometer must be loaded and in an idle state. 

Posteonditions; 

Data file loaded. 

Normal Course of 

1. The user seleets the Open Data File eommand. 

Events; 

2. DataThermometer eheeks to see if a data file is already 
loaded. 

3. A file is loaded. DataThermometer asks the user if it 
should elose the file. 

4. User responds “Yes” to elose the file. 

5. DataThermometer eloses the file and prompts for the 
new data file’s name. 

6. DataThermometer displays direetory tree. 

7. User navigates direetory tree. 

8. User seleets a folder and enters a file name for the data 
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file. 

9. DataThermometer creates the data file. 

10. DataThermometer tells the user to enter the name of 
the dataset. 

11. User gives a name for the dataset 

12. DataThermometer stores the data set’s name in the 
data file. 

13. DataThermometer creates the Access database for the 

_ dataset, named: the name of the datafile + “-db.mdb” 

Alternative Courses; 3. A data file isn’t loaded. 

4. DataThermometer prompts for the new data file’s 
name. 

5. User navigates directory tree. 

6. User selects a folder and enters a file name for the data 
file. 

7. DataThermometer creates the data file. 

8. DataThermometer tells the user to enter the name of 
the dataset. 

9. User gives a name for the dataset 

10. DataThermometer stores the data set’s name in the 
data file. 

11. DataThermometer creates the Access database for the 
dataset, named; the name of the datafile + “-db.mdb” 


9. & 7. DataThermometer can’t create data file. Gives an 
error. 


11. User doesn’t enter a name for the dataset. (Just hits 
enter) 

12. DataThermometer sets dataset name to equal the data 
file’s filename 

13. DataThermometer creates the Access database for the 

_ dataset, named; the name of the datafile + “-db.mdb” 

_ Exceptions; TBD _ 

_ Includes; N/A _ 

Assumptions; _ 

Notes and Issues; Exceptions will be determined at a later date. 

Requirements; 1.1, 1.3, 2.1, 3.1, 3.4, 3.4.1-3.4.6_ 
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Use Case ID: 4_ 

Use Case Exit DataThermometer 

_ Name: ___ 

Created By: Matt Behnke _ Last Updated By: Matt Behnke 

Date Created: 6/16/2002 Date Last 6/16/2002 

_Updated:_ 


Aetor: 

User, DataThermometer, Data file 

Deseription: 

DataThermometer shuts down. 

Preeonditions: 

DataThermometer is running and idle. 

Posteonditions: 

DataThermometer stops. 

Normal Course of 

1. User seleets Exit DataThermometer eommand. 

Events: 

2. DataThermometer prompts, “Are you sure you want to 
exit?” 

3. User confirms, “Yes”. 

4. DateThermometer checks if a data file is open. 

5. A data file is open 

6. DataThermometer closes the file. 

7. DataThermometer shuts down. 

Alternative Courses: 

5. No data file open. 

6. DataThermometer shuts down. 


3. User doesn’t confirm, “No” 

4. DataThermometer returns to idle. 

Exeeptions: 

TBD 

Ineludes: 

Close Eile (Use Case ID: 2) 


Speeial Requirements: _ 

_ Assumptions: _ 

Notes and Issues: Exeeptions will be determined at a later date. 
_Requirements: 1.1, 1.3, 2.6, 2.6.1_ 


Use Case ID: 

5 



Use Case 
Name: 

Import Data from Excel 



Created By: 

Matt Behnke 

Last Updated By: 

Matt Behnke 

Date Created: 

6/16/2002 

Date Last 
Updated: 

6/16/2002 


Actor: 

DataThermometer, User, Microsoft Excel Eile 

Description: 

Imports data from Excel into an empty data file. Stores the 
data in an Microsoft Access database. An example of the 
import screen can be found below in Eigure 1. 

Preconditions: 

DataThermometer has been started, a new data file has been 
loaded. 























































Postconditions; 

Data is added to the new data file. 

Normal Course of 
Events; 

1. User selects Import Data from Excel. 

2. DataThermometer makes sure a new file has been 
created. 

3. A new data file has been created. 

4. DataThermometer prompts for the Excel file name. 

5. User selects file. 

6. DataThermometer asks which sheets in the Excel file 
to choose and allows changes to be made to the name 
of the imported table. (See Eigure 1 below) 

7. User places a check next to the sheet names to be 
imported and makes any changes to the table names. 

8. User selects the import command. 

9. DataThermometer imports the selected worksheets 
from the Excel file to the Access database. 

Alternative Courses; 

3. A new data file hasn’t been created. 

4. DataThermometer tells the user to create a new data 
file in order to use this function. 

5. User cancels the selection of the Excel file. 

6. DataThermometer stops the import command and 
returns to an idle state. 

8. User selects the cancel import command. 

9. DataThermometer stops the import command and 
returns to an idle state. 

Exceptions; 

TBD 

Includes; 


Special Requirements; 


Assumptions; 


Notes and Issues; 

Exceptions will be determined at a later date. 

Requirements; 

1.1, 1.3, 2.2, 4.1, 4.2, 4.3, 4.4 

DataThermometer 

Import from Excel 

Please select sheets to Import and make 
any changes to the imported table name. 

Import? Excel Sheet Access Table 

□ Sheetl Records 

□ Sheet? Descriptors 

□ Sheets Sheets 

1 Import 1 1 Cancel I 

Fiqure 1. Example import screen. 
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Use Case ID; 

6 



Use Case 
Name; 

Perform Data Analysis 



Created By; 

Matt Behnke 

East Updated By; 

Matt Behnke 

Date Created; 

6/16/2002 

Date East 
Updated; 

6/16/2002 


Actor; 


Description; 


Preconditions; 


Postconditions; 


Normal Course of 
Events; 


Alternative Courses; 


Exceptions; 


Ineludes; 


Speeial Requirements; 


_Assumptions; 


Notes and Issues; 


Requirements; 


DataThermometer, User, Data file 


The user selects a data processing method to be ran on the 
data file’s data stored in it’s database. During this proeess the 
data is manipulated and fed through ealculations. The results 
of the calculations are stored in an Excel spreadsheet. 
DataThermometer stores which data analysis methods have 
been run on the data file and the date in whieh it ran. 


DataThermometer must be running and be in an idle state. A 
DataThermometer data file must be loaded and eontain data. 


Data analysis results are ereated in a Microsoft Excel 
spreadsheet. A link to the spreadsheet is stored in 
DataThermometer’s data file. 


1. User seleets Perform Data Analysis 

2. DataThermometer prompts the user to seleet the data 
analysis funetion that will be performed. 

(funetions map to analysis funetions from previous 
interations). 

3. DataThermometer obtains the data it needs from the 
Aeeess database. 

4. DataThermometer proeesses the data. 

5. DataThermometer ereates an Exeel file and stores the 
results in it. 

6. DataThermometer ereates a link to the location of the 
resulting Exeel file. 

7. DataThermometer stores the faet that the seleeted data 
analysis method has been ran on the dataset, along 
with the date it ran. 



Exceptions will be determined at a later date. 


1.1, 1.3,2.3, 3.5, 3.5.1, 3.5.2, 5.1, 5.2 
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Use Case ID; 

7 



Use Case 
Name; 

View Data Analysis Result 

;s 


Created By; 

Matt Behnke 

East Updated By; 

Matt Behnke 

Date Created; 

6/16/2002 

Date East 
Updated; 

6/16/2002 


Actor; DataThermometer, User, Excel file containing the results of 

_ the selected function. _ 

Description; After a data analysis function has been performed a link to the 
spreadsheet file with the results is stored. When users want to 
see the results of a particular analysis they select this 

_ command. This command opens the Excel file. _ 

Preconditions; DataThermometer must be running and be in an idle state and 

_ the data analysis function must have been ran. _ 

_ Postconditions; If it exists, the Excel file with the analysis results is opened. 

Normal Course of 1. User selects View Results of Data Analysis. 

Events; 2. DataThermometer checks to see if any data analysis 

has been performed. 

3. Analysis has been performed. 

4. DataThermometer. 

5. DataThermometer attempts to open the Excel file with 
the results based upon the stored file name. 

_ 6. The Excel file is opened. _ 

Alternative Courses; 3. The function hasn’t been performed. 

4. DataThermometer tells the user to run Data Analysis 
Eunetion xxx before trying to view the results. 

_ 5. The Exeel file ean’t be found. _ 

Exceptions; TBD 

_ Includes; _ 

Special Requirements; _ 

_ Assumptions; _ 

Notes and Issues; Exceptions will be determined at a later date. _ 

_Requirements; 1.1, 1.3, 2.4, 6.1_ 


117 






































Use Case ID; 

8 

Use Case 
Name; 

Open Help Doeumentation 

Created By; 

Matt Behnke 

Last Updated By; 

Matt Behnke 

Date Created; 

6/16/2002 

Date Last 
Updated; 

6/16/2002 


Actor; 

DataThermometer, User 

Description; 

DataThermometer’s help screen is opened. 

Preconditions; 

DataThermometer must be running and in an idle state. 

Postconditions; 

Help screen is opened. 

Normal Course of 

1. User selects Open Help Documentation. 

Events; 

2. DataThermometer displays the help documentation. 

Alternative Courses; 


Exceptions; 

TBD 

Includes; 


Special Requirements; 


Assumptions; 


Notes and Issues; 

Exceptions will be determined at a later date. 

Requirements; 

1.1, 2.5 
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APPENDIX F - INSPEC DATA FIELDS 



INSPEC records are divided into the following fields, listed in alphabetic order. 

Highlighted fields are 




AA 



MN 


AB 



Nl 


Al 



OP 


AN 



PA 


AU 



PD 


AV 



PF 


cc 



PI 


CD 



PR 


Cl 



PY 


CL 



RF 


CO 



RN 


CP 



RT 


cs 


(*) 

SC 

(*) 

DE 



SF 


DN 


(*) 

SK 


DOI 


(*) 

SO 


DS 



ST 

(*) 

DU 


(*) 

SU 

(DE and ID) 

ED 



Tl 


IB 



TL 


ID 



TR 


IS 



UD 


LA 



UR 

(*) 

MD 






(*) This field is for display only; you cannot search in this field. 
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APPENDIX G - ACCESSION NUMBERS & DATES 


File 2 (1969 to present) 


Year 

Accession Nnmbers 

1969 

00000001-00077999 

1970 

00078000-00203427 

1971 

00203428-00329824 

1972 

00329825-00462218 

1973 

00462219-00581540 

1974 

00581541-00703915 

1975 

00703916-00839869 

1976 

00839870-00990949 

1977 

00990950-01126133 

1978 

01126134-01276268 

1979 

01276269-01432493 

1980 

01432494-01604845 

1981 

01604846-01772464 

1982 

01772465-01959518 

1983 

01959519-02153708 

1984 

02153709-02349062 

1985 

02349063-02556647 

1986 

02556648-02774876 

1987 

02774877-03021189 

1988 

03021190-03259711 

1989 

03259712-03506594 

1990 

03506595-03764942 

1991 

03764943-04024754 

1992 

04024755-04315714 

1993 

04315715-04559711 

1994 

04559712-04840653 

1995 

04840654-05145636 

1996 

05145637-05457178 

1997 

05457179-05787740 

1998 

05787741-06129398 

1999 

06129399-06483807 

2000 

06483808-06796501 
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Entropy S,.,(X,V9 (output) Messages N,,, (output) 


APPENDIX H -AFFILIATION DISTRIBUTION MACRO OUTPUT 

CHARTS 


World Publications Map - 
Output vs Input 



5 - 

4 - 

3 - 

2 - 

1 - 

0 - 
0 


World Entropy Map - Sk^i{X,Y),Sk(X,Y) 
Output vs Input 
Ada (month) 



—•— Mean Entropy S_k+1 vs S_k 
-y=x 


3 4 5 6 

Entropy S i, (X) (Input) 
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Pressure (A) Messages per node 


Pressure vs Temperature Saboe Degrees 



Temperature vs Timestep 



35000 

30000 

25000 

20000 

15000 

10000 

5000 

0 

-5000 

-10000 
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Temperature (B) (Saboe Degrees) 









Temperature (A) (Saboe Degrees 


Temperature vs Timestep 



12000 j 

—•— Messs^es per Node (X) 

—•—Temp Saboe_Degrees 
—•—Templrom Power FitI 

-Linear (Temp from Power FitI) 

-Linear (Messsages per Node (X)) 

-Linear (Temp_Saboe_Degrees) 


8000 - 


6000 - 


4000 - 


2000 




y = 


115.78X- 


35000 

30000 

25000 

20000 

15000 

10000 

5000 

0 


N ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 


k Timestep (months) 


Interacting Systems A and B (Constant Messages in Total System AB) 


2nnnn 
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Temperature (B) (Saboe Degrees) 












Entropy 2 Interacting Systems T_S (A) 

T(X) = -380.97S(X)^ + 4372S(X) - 9661.9 
=0.4993 

S(X) Entropy 


0 1000 2000 3000 4000 5000 6000 



Entropy 2 Interacting Systems A and B 



n Messages _ A 
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T(X) Temperature (Degrees Saboe) 

















2.00 

1.80 

1.60 


Learning Curve - ABCD_Band (Mean and Capacity) 
Ada (month) 


Q. 1.40 

03 

W 

0) 

I 1.20 

= 1.00 

0) 

Q. 

J2 0.80 

(A 

re 

H 

^ 0.60 
0.40 

0.20 

0.00 


+3 a Best Expected Ave 'I' Production Tasks per Time Step 



8.00 10.00 12.00 
Cumulative Production of Tasks (x) 


O A_Band_Mean 
- - Calculated Y 

X 3 Sigma 3 Sigma 
o B_Band_Mean 
A C_Band_Mean 
• D_Band_Mean 

-A_Band_Kappa 

- A_Band_+3Sigma_Capaclty_Kappa 

-A_Band Kappa/2 

-r-p 


r=J5:(x+p)/(x+r+p) 
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r= 1.2971 

p= 0.2286 

R2= -0.116 
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A_Band Entropy Map - Sk+i(X,Y),Sk(X,Y) 
Output vs Input 
Ada (month) 


9 
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9 
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Entropy (output) Messages (output) 


B_Band Publications Map — N|(+t,N| 
Output vs Input 
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B_Band Entropy Map - S|„„d(X,Y)k.„ S„<„id(X,Y), 
Ada (month) 
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C_Band Entropy Map - S^,^■^{X,Y),S^,{X,y) 
Output vs Input 
Ada (month) 


10 4 




- Mean Entropy S_k+1 vs S_k 
y=x 


Entropy (Input) 
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? 9 

a 

3 

O 8 


0 4-r 

0 1 
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y=x 
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Entropy (output) Messages (output) 


3000 n 


D_Band Publications Map — N|(+t,N|( 
Output vs Input 
Ada (month) 
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D_Band Entropy Map - S|„„d(X,Y)k.„ S„<„id(X,Y), 
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World Productivity In Pubs (Cumulative over k) 
Ada (month) 



World Productivity Index (Cumulative over k) 
Ada (month) 

2.00 n- 
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Learning Curve - World (Mean and Capacity) 
Ada (month) 


+2 c Best Ave T Production Tasks per Time Step 


® 2.00 

M 

0) 

E 


<-P-> 

r-P 


8.00 10.00 12.00 14.00 

Cumulative Production of Tasks (x) 


O Band_Mean 
- ^ - Calculated Y 

- Kappa 

— 0 — 3 Sigma 3 Sigma 

-Kapp/2 

-r-p 


W=K(x-*-p)/(x+r*p) 
K= -0.0297 
r= 30.1475 
p= -7.5538 

R2= 0.9869 
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APPENDIX I - ENTROPY LAMBDA MACRO OUTPUT CHARTS 


Entropy Lamba Macro Outputs 


Correlate S Lambda sheet; 


m_s 0.141 mjambda 0.001 

b_s 4.7841 bjambda 0.995 


TimeStep (k) 

month/year 

Entropy (S) 

Lambda 

Entropy(Lambda_ 

delta lambda 

Lambda from beta goal Beta goal to min R''2 

2 

12/1/1980 

5.275314635 

0.994245629 

5.251483194 

0.000567938 

0.995657995 

0.132905382 

3 

12/1/1981 

5.538028259 

0.995924291 

5.517431891 

0.00042421 

0.996006903 

0.10208582 

4 

12/1/1982 

6.067531972 

0.996810999 

6.098500836 

0.000959071 

0.996714464 

0.097042997 

5 

12/1/1983 

6.178802542 

0.997364327 

6.198915371 

0.000404526 

0.996829915 

0.083052859 

6 

12/1/1984 

6.135650861 

0.99774469 

6.162750561 

0.000734394 

0.99678855 

0.0700925 

7 

12/1/1985 

6.221374187 

0.998023296 

6.252729979 

0.000983186 

0.996891026 

0.063436288 

8 

12/1/1986 

6.379739802 

0.998236756 

6.399690706 

0.000398039 

0.99705529 

0.059736593 

9 

12/1/1987 

6.530833163 

0.998405885 

6.553265561 

0.000503212 

0.997222992 

0.057268005 

10 

12/1/1988 

6.533517799 

0.998543422 

6.560591028 

0.00073296 

0.997230893 

0.052462711 

11 

12/1/1989 

6.652679089 

0.998657616 

6.67988532 

0.000740179 

0.99735835 

0.050683887 

12 

12/1/1990 

6.652679089 

0.998754051 

6.680920936 

0.000797602 

0.997359446 

0.047053378 

13 

12/1/1991 

6.734791162 

0.998836647 

6.762053036 

0.00074321 

0.997444832 

0.045402533 

14 

12/1/1992 

6.848164916 

0.99890824 

6.874037342 

0.000669382 

0.997561031 

0.044642434 

15 

12/1/1993 

6.906260151 

0.998970934 

6.930567623 

0.000590853 

0.997618977 

0.043102516 

16 

12/1/1994 

7.037288817 

0.999026323 

7.060339643 

0.000531341 

0.997750243 

0.04316863 

17 

12/1/1995 

7.186357395 

0.99907564 

7.208449878 

0.000488078 

0.997897162 

0.043854029 

18 

12/1/1996 

7.321492306 

0.999119851 

7.342667513 

0.000448389 

0.998027734 

0.044528711 

19 

12/1/1997 

7.366025475 

0.999159727 

7.386051569 

0.000401044 

0.998069433 

0.043429703 

20 

12/1/1998 

7.423671507 

0.999195888 

7.454260879 

0.00093571 

0.998134504 

0.0430129 

21 

12/1/1999 

7.496781098 

0.999228842 

7.525898257 

0.000847809 

0.998202213 

0.042806722 

CM 

CM 

12/1/2000 

7.535631281 

0.999259005 

7.562510818 

0.00072251 

0.99823657 

0.041934104 


Entropy Lambda sheet; 

y = 212.96x- y= y = 

497.75 4.7841x0.1419 0.995x0.0015 

R2 = 0.989 R2 = 0.9797 R2 = 0.9532 


TimeStep (k) 

1 

month/year 

12/1/1979 

Records (cumu 
11 

Entropy (S) 
4.771928095 

S(k+1) 

5.275314635 

du_(t-c) 

du_(t) 

B_y_10% 

Lambda_B10% 

2 

12/1/1980 

56 

5.275314635 

5.538028259 

212.96 

0.371907208 

4.281069006 

0.994245629 

3 

12/1/1981 

179 

5.538028259 

6.067531972 

212.96 

0.262525929 

4.542103968 

0.995924291 

4 

12/1/1982 

321 

6.067531972 

6.178802542 

212.96 

0.205045305 

5.070720974 

0.996810999 

5 

12/1/1983 

505 

6.178802542 

6.135650861 

212.96 

0.169279407 

5.181438215 

0.997364327 

6 

12/1/1984 

824 

6.135650861 

6.221374187 

212.96 

0.144739623 

5.137906171 

0.99774469 

7 

12/1/1985 

910 

6.221374187 

6.379739802 

212.96 

0.126788582 

5.223350891 

0.998023296 

8 

12/1/1986 

1235 

6.379739802 

6.530833163 

212.96 

0.113048567 

5.381503046 

0.998236756 

9 

12/1/1987 

1361 

6.530833163 

6.533517799 

212.96 

0.102170389 

5.532427278 

0.998405885 

10 

12/1/1988 

1434 

6.533517799 

6.652679089 

212.96 

0.093329591 

5.534974377 

0.998543422 

11 

12/1/1989 

1873 

6.652679089 

6.652679089 

212.96 

0.085992989 

5.654021473 

0.998657616 

12 

12/1/1990 

1873 

6.652679089 

6.734791162 

212.96 

0.07979996 

5.653925038 

0.998754051 

13 

12/1/1991 

1991 

6.734791162 

6.848164916 

212.96 

0.074497555 

5.735954515 

0.998836647 

14 

12/1/1992 

2374 

6.848164916 

6.906260151 

212.96 

0.06990293 

5.849256676 

0.99890824 

15 

12/1/1993 

2478 

6.906260151 

7.037288817 

212.96 

0.065880513 

5.907289217 

0.998970934 

16 

12/1/1994 

2875 

7.037288817 

7.186357395 

212.96 

0.062327585 

6.038262494 

0.999026323 

17 

12/1/1995 

3196 

7.186357395 

7.321492306 

212.96 

0.059164846 

6.187281755 

0.99907564 

18 

12/1/1996 

3529 

7.321492306 

7.366025475 

212.96 

0.056330069 

6.322372455 

0.999119851 

19 

12/1/1997 

3746 

7.366025475 

7.423671507 

212.96 

0.053773712 

6.366865748 

0.999159727 

20 

12/1/1998 

3899 

7.423671507 

7.496781098 

212.96 

0.051455831 

6.424475619 

0.999195888 

21 

12/1/1999 

4064 

7.496781098 

7.535631281 

212.96 

0.049343845 

6.497552256 

0.999228842 

22 

12/1/2000 

4195 

7.535631281 

0 

212.96 

0.047410909 

6.536372275 

0.999259005 


beta / authors 


Mindshare 

0.001399004 
0.000302925 
0.000167028 
9.66855E-05 
5.11251E-05 
4.06904E-05 
2.80322E-05 
2.44317E-05 
2.10356E-05 
1.55807E-05 
1.44646E-05 
1.31259E-05 
1.06241E-05 
9.80717E-06 
8.39857E-06 
7.59903E-06 
6.91547E-06 
6.29325E-06 
5.94183E-06 
5.62432E-06 
5.29204E-06 


10% 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 
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Entropy (k+1) Messages 


Ada Messages vs. Time 


♦ Messages (Cum) 

- - Linear (Messages (Cum)) 




^ 1y = 212.96x-497.75| 
I = 0.989 I 


0 4 - 4 — 


k Timestep (years) 


7 - 


4.5 4 


Entropy Discrete Time Map (Ada) 
Map Entropy K, K+1 


y = 0.8599x + 1.0476 
= 0.9729 


♦ Entropy Map S_k+1, S_k 
- - Linear (Entropy Map S_k+1, S_k) 


Entropy (K) 
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Entropy (S^ 


Entropy (SB) f(j, B) 
Entropy and Lambda 






A A ^ 

A_4 - - ■- 

= 0.9797 

A 

_ - -A"" A ' 

— - "A 

ff-'* . « 



♦ , 

♦ • 

1 1 

» ♦ 

- 1 - 


♦ , - - 



y = U.yybx''-““''' 

= 0.9532 

-- 






1 2 3 4 5 


A Entropy (Information SH) 

♦ j with B = 10% 

— - — • Power (Entropy (Information SH)) 

- - - Power (j with B = 10%) 


9 10 11 12 13 14 15 16 17 18 19 20 21 22 

k Timestep (years) 


Entropy and Beta_k 

Actual Entropy, Entropy as a function of lambda and beta 



k Timestep (years) 
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Beta percent of feedback 




















Beta per Author 


Author Instances and Mindshare 
Ada 
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Author Instances 






APPENDIX J - Q LEVEL ANALYSIS MACRO OUTPUT CHARTS 


9 ada_qlevels_term_month2.x(s 





c 

D 

E 

F 

G 

H 

1 

J 

K 


rr 

1 











— 

2 




Number of term instances in each q-Level per year 




3 

year 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


4 

1979 

0 

40 

86 

57 

35 

20 

10 

4 

1 

0 

0 


5 

1980 

01 

118 

141 

79 

44 

23 

10 

4 

1 

0 

0 


6 

1981 

0 

377 

365 

167 

81 

39 

16 

5 

1 

0 

0 


7 

1982 

0 

731 

791 

381 

188 

93 

43 

18 

7 

2 

0 


8 

1983 

0 

1184 

1321 

642 

314 

155 

75 

35 

16 

6 

1 


9 

1984 

0 

1935 

2009 

898 

392 

175 

77 

35 

16 

6 

1 


10 

1985 

0 

2150 

2237 

997 

430 

186 

79 

35 

16 

6 

1 


11 

1986 

0 

3165 

3637 

1707 

748 

324 

136 

58 

27 

12 

4 


12 

1987 

0 

3628 

4521 

2254 

1077 

521 

261 

139 

79 

43 

21 


13 

1988 

0 

3903 

5005 

2536 

1230 

601 

305 

163 

94 

53 

27 


1^ 

1989 

0 

5593 

8257 

4537 

2398 

1274 

710 

425 

281 

199 

143 


15 

1990 

0 

5593 

8257 

4537 

2398 

1274 

710 

425 

281 

199 

143 


16 

1991 

0 

6141 

9588 

5438 

2981 

1636 

926 

549 

349 

234 

158 


17 

1992 

0 

7839 

13067 

7602 

4206 

2265 

1223 

687 

420 

271 

176 


18 

1993 

0 

8278 

13881 

8081 

4453 

2374 

1263 

701 

424 

271 

176 


19 

1994 

0 

10112 

18229 

10992 

6311 

3527 

1981 

1161 

742 

509 

364 


20 

1995 

0 

11761 

22307 

13742 

8048 

4554 

2557 

1476 

914 

604 

417 


21 

1996 

0 

13618 

27600 

17511 

10625 

6257 

3657 

2174 

1360 

898 

615 


22 

1997 

0 

14862 

31098 

19982 

12286 

7314 

4298 

2546 

1573 

1019 

684 


23 

1998 

0 

15689 

33252 

21462 

13243 

7893 

4621 

2715 

1655 

1060 

703 


24 

1999 

0 

16654 

35907 

23317 

14463 

8640 

5043 

2937 

1764 

1108 

720 

_ 

25 

2000 

0 

17347 

37737 

24585 

15298 

9159 

5347 

3105 

1852 

1153 

743 


?R 














M < ► H / 25 Term summary \q summary year/ q summary monthlyJnstanc 0 1 <J| >11 


q-level Distribution 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 

q level 
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Entropy Term Instances 


Ada - number of term instances of all q over time 



k Timestep (months) 

Ada - Entropy of aii q over time 



k Timestep (months) 
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Temp 


Ada - T(x) 

after Weibull Curve Fit 



k Timestep (months) 
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APPENDIX K - TECH OASIS EXPORT SCRIPT 


' Macro: dataExport.tmf 
' Author: Matt Behnke 

' Description: This macro exports the data fields from Tech OASIS into MS Excel 
' along with a unique identifier so the data can be imported in to MS Access 
' where relationships can be maintained to allow further data manipulation 

' ***=i==i==i==i==i==i=]qj7XT VERSION—> Export directly to MS Access. 

Option Explicit 

Dim nStatus, i, j 

Dim exApp, strUniquelDField 

Dim strActiveDS, nRows, nCols 

Dim exRowCount, strUniqueValue, strView, strVal, NameLength 
Dim n, dataFieldl, dataField2, dataFieldS, dataFieldd, dataFieldS 
Dim nDataFields, booleanincremented 

nStatus = App.GetActiveDSName(strActiveDS) 

'get the field of the unique identifier 

msgbox(" Select unique identifier field ") 

nStatus = Dataset.PromptForField(strUniquelDField) 

'*Next version, ask the user the number of 
'*datafields to export... 


'prompt the user to select the datafields to export 

msgbox(" Select data field 1 ") 

nStatus = Dataset.PromptForField(dataFieldl) 

msgbox(" Select data field 2 ") 

nStatus = Dataset.PromptForField(dataField2) 

msgbox(" Select data field 3 ") 

nStatus = Dataset.PromptForField(dataField3) 

msgbox(" Select data field 4 ") 

nStatus = Dataset.PromptForField(dataField4) 

' msgbox(" Select data field 5 ") 

' nStatus = Dataset.PromptForField(dataField5) 


'Open Excel Workbook 

Set exApp = CreateObject("Excel.Application") 
exApp.Visible = True 
exApp. W orkbooks. Add 
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getData(dataFieldl) 

getData(dataField2) 

getData(dataField3) 

getData(dataField4) 

'getData(dataField5) 


Function getData(datafield) 


one 


'create a matrix in Tech oasis of the datafield on the left and the unique ID field 
'on the top.. Syntax: 

'View.CreateMatrix(leftfieldname, leftcontent, topfieldname, topcontent, 

' matrixtype, retumviewname) 


nStatus = 

View.CreateMatrix(dataField,"lJNGROUPED",strUniqueIDField,"lJNGROUPED","COOCCURENCE",St 
rView) 

'sort the matrix sort by the unique ID field. 
nStatus=Matrix.Sort("ROW",2,"DESCEND") 

'obtain the number of rows and columns in the created matrix 

nstatus=Matrix.GetNumColumns(nCols) 

nstatus=Matrix.GetNumRows(nRows) 

'create a new excel datasheet 
ex App .Sheets. Add 
exApp.Sheets(l).Name = datafield 

'put in column headers 

nStatus=Matrix.GefValue(2,0,strVal) 'start of columns (unique ID field) 
exApp.cells(l, 1).formula = strVal 

nStatus=Matrix.GetValue(0,2,strVal) 'start ofrows (datafield name) 
exApp.cells(l, 2).formula = strVal 


exRowCount = 2 'counter for the number of rows in the excel sheet 
for i = 3 To nCols 

nStatus=Matrix.GefValue(2,i,strUniqueValue) 
exApp.cells(exRowCount, 1).formula = strUniqueValue 

for j = 3 to nRows 

nstatus=Matrix.GetValue(j,i,strVal) 

NameLength = InStr(strVal, "") 

if NameLength <> 0 then 'if string isn't null then add values to sheet 
nStatus=Matrix. Get V alue(j ,2, strV al) 
exApp.cells(exRowCount, 2).formula = strVal 
exApp.cells(exRowCount, 1). formula = strUniqueValue 
exRowCount = exRowCount + 1 
booleanincremented = true 
end if 
next 'row 

if booleanincremented = false then 
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exRowCount = exRowCount + 1 
end if 

next 'next column 
end function 
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APPENDIX L - ASSIGN ACCESSION NUMBER MACRO 


Macro: As sign AN 

Description: Assigns the date and year to the Title worksheet that has been 

exported from Tech OASIS. Works with INSPEC database data between 
Jan 1969 and Dec 2000. 

Future work: remove the need to copy the "AN" worksheet to the workbook that 
contains the "Title" worksheet. 


Sub assignANO 

nRowsAN = CountRows("AN", 1) 
nRowsTitle = CountRows("Title", 1) 

Sheets("Title").Select 
Sheets("Title'icells(l, 3) = "PubDate" 

Sheets("Title").Cells(l, 4) = "PubYear" 

For X = 2 To nRowsTitle 

compare = Sheets("Title").Cells(x, 1) 

For i = 2 To nRowsAN 

timeint = Sheets("AN").Cells(i, 1) 
startAN = Sheets("AN").Cells(i, 2) 
endAN = Sheets("AN").Cells(i, 3) 

If compare >= startAN And compare <= endAN Then 
Sheets("Title").Cells(x, 3) = timeint 
Sheets("Title").Cells(x, 4) = Year(timelnt) 

End If 
Next i 
Next X 

End Sub 

Function CountRows(ByVal sheetName As String, ByVal colNum As Integer) As Double 
On Error Resume Next 
Dim currCell As Range, rowNum As Double 

SheetsC" & sheetName).Select 

If IsNumeric(colNum) Then 
Else 

colNum = 1 
End If 

rowNum = 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Do While currCell.Value <> "" 
rowNum = rowNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Loop 
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CountRows = rowNum - 1 


End Function 


The following table maps the accession numbers to the month and year. 


Timeint 

Jan-69 
Feb-69 
Mar-6 9 
Apr-69 
May-6 9 
Jun-69 
Jul-69 
Aug-69 
Sep-69 
Oet-69 
Nov-69 
Dee-69 
Jan-70 
Feb-70 
Mar-70 
Apr-70 
May-70 
Jun-70 
Jul-70 
Aug-70 
Sep-70 
Oet-70 
Nov-70 
Dee-70 
Jan-71 
Feb-71 
Mar-71 
Apr-71 
May-71 
Jun-71 
Jul-71 
Aug-71 
Sep-71 
Oct-71 
Nov-71 
Dec-71 
Jan-72 
Feb-72 
Mar-72 
Apr-72 
May-72 
Jun-72 


Start 

1 

6,502 

13,002 

19,502 

26,001 

32.501 
39,001 

45.501 
52,001 

58.501 
65,000 

71,500 
78,000 
88,453 
98,906 

109,358 

119,810 

130,262 

140,715 

151,167 

161,619 

172,071 

182,524 

192,976 

203,428 

213,962 

224,495 

235,028 

245,561 

256,094 

266,627 

277,160 

287,693 

298,226 

308,759 

319,292 

329,825 

340,859 

351,892 

362,924 

373,957 

384,990 


End 

6,501 

13,001 

19,501 

26,000 

32.500 
39,000 

45.500 
52,000 

58.500 

64.999 
71,499 

77.999 
88,452 
98,905 

109,357 

119,809 

130,261 

140,714 

151,166 

161,618 

172,070 

182,523 

192,975 

203,427 

213,961 

224,494 

235,027 

245,560 

256,093 

266,626 

277,159 

287,692 

298,225 

308,758 

319,291 

329,824 

340,858 

351,891 

362,923 

373,956 

384,989 

396,022 
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Jul-72 

Aug-72 

Sep-72 

Oct-72 

Nov-72 

Dec-72 

Jan-73 

Feb-73 

Mar-73 

Apr-73 

May-73 

Jun-73 

Jul-73 

Aug-73 

Sep-73 

Oct-73 

Nov-73 

Dec-73 

Jan-74 

Feb-74 

Mar-74 

Apr-74 

May-74 

Jun-74 

Jul-74 

Aug-74 

Sep-74 

Oct-74 

Nov-74 

Dec-74 

Jan-75 

Feb-75 

Mar-75 

Apr-75 

May-75 

Jun-75 

Jul-75 

Aug-75 

Sep-75 

Oct-75 

Nov-75 

Dec-75 

Jan-76 

Feb-76 

Mar-76 

Apr-76 

May-76 

Jun-76 

Jul-76 

Aug-76 


396,023 

407,055 

418,088 

429,121 

440,154 

451,186 

462,219 

472,163 

482,107 

492,050 

501,994 

511.937 

521.881 
531,824 
541,767 
551,711 
561,654 
571,598 

581.541 
591,740 

601.938 
612,136 
622,333 
632,531 
642,729 
652,927 
663,125 
673,323 
683,520 
693,718 
703,916 
715,246 
726,576 
737,905 
749,235 
760,564 
771,894 
783,223 
794,552 

805.882 
817,211 

828.541 
839,870 
852,461 
865,051 
877,641 
890,231 
902,821 
915,411 
928,000 


407,054 

418,087 

429,120 

440,153 

451,185 

462,218 

472,162 

482,106 

492,049 

501,993 

511.936 

521.880 
531,823 
541,766 
551,710 
561,653 
571,597 

581.540 
591,739 

601.937 
612,135 
622,332 
632,530 
642,728 
652,926 
663,124 
673,322 
683,519 
693,717 
703,915 
715,245 
726,575 
737,904 
749,234 
760,563 
771,893 
783,222 
794,551 

805.881 
817,210 

828.540 
839,869 
852,460 
865,050 
877,640 
890,230 
902,820 
915,410 
927,999 
940,589 
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Sep-76 

940,590 

953,179 

Oct-76 

953,180 

965,769 

Nov-76 

965,770 

978,359 

Dec-76 

978,360 

990,949 

Jan-77 

990,950 

1,002,215 

Feb-77 

1,002,216 

1,013,481 

Mar-77 

1,013,482 

1,024,746 

Apr-77 

1,024,747 

1,036,011 

May-77 

1,036,012 

1,047,276 

Jun-77 

1,047,277 

1,058,542 

Jul-77 

1,058,543 

1,069,807 

Aug-77 

1,069,808 

1,081,072 

Sep-77 

1,081,073 

1,092,337 

Oct-77 

1,092,338 

1,103,603 

Nov-77 

1,103,604 

1,114,868 

Dec-77 

1,114,869 

1,126,133 

Jan-78 

1,126,134 

1,138,645 

Feb-78 

1,138,646 

1,151,156 

Mar-78 

1,151,157 

1,163,668 

Apr-78 

1,163,669 

1,176,179 

May-78 

1,176,180 

1,188,690 

Jun-78 

1,188,691 

1,201,201 

Jul-78 

1,201,202 

1,213,712 

Aug-78 

1,213,713 

1,226,223 

Sep-78 

1,226,224 

1,238,735 

Oct-78 

1,238,736 

1,251,246 

Nov-78 

1,251,247 

1,263,757 

Dec-78 

1,263,758 

1,276,268 

Jan-79 

1,276,269 

1,289,288 

Feb-79 

1,289,289 

1,302,306 

Mar-79 

1,302,307 

1,315,325 

Apr-79 

1,315,326 

1,328,344 

May-79 

1,328,345 

1,341,362 

Jun-79 

1,341,363 

1,354,381 

Jul-79 

1,354,382 

1,367,400 

Aug-79 

1,367,401 

1,380,418 

Sep-79 

1,380,419 

1,393,437 

Oct-79 

1,393,438 

1,406,456 

Nov-79 

1,406,457 

1,419,474 

Dec-79 

1,419,475 

1,432,493 

Jan-80 

1,432,494 

1,446,857 

Feb-80 

1,446,858 

1,461,219 

Mar-80 

1,461,220 

1,475,582 

Apr-80 

1,475,583 

1,489,944 

May-80 

1,489,945 

1,504,307 

Jun-80 

1,504,308 

1,518,670 

Jul-80 

1,518,671 

1,533,032 

Aug-80 

1,533,033 

1,547,395 

Sep-80 

1,547,396 

1,561,757 

Oct-80 

1,561,758 

1,576,120 
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Nov-80 

1,576,121 

1,590,482 

Dec-80 

1,590,483 

1,604,845 

Jan-81 

1,604,846 

1,618,814 

Feb-81 

1,618,815 

1,632,782 

Mar-81 

1,632,783 

1,646,751 

Apr-81 

1,646,752 

1,660,719 

May-81 

1,660,720 

1,674,687 

Jun-81 

1,674,688 

1,688,655 

Jul-81 

1,688,656 

1,702,623 

Aug-81 

1,702,624 

1,716,591 

Sep-81 

1,716,592 

1,730,560 

Oct-81 

1,730,561 

1,744,528 

Nov-81 

1,744,529 

1,758,496 

Dec-81 

1,758,497 

1,772,464 

Jan-82 

1,772,465 

1,788,053 

Feb-82 

1,788,054 

1,803,641 

Mar-82 

1,803,642 

1,819,228 

Apr-82 

1,819,229 

1,834,816 

May-82 

1,834,817 

1,850,404 

Jun-82 

1,850,405 

1,865,992 

Jul-82 

1,865,993 

1,881,579 

Aug-82 

1,881,580 

1,897,167 

Sep-82 

1,897,168 

1,912,755 

Oct-82 

1,912,756 

1,928,343 

Nov-82 

1,928,344 

1,943,930 

Dec-82 

1,943,931 

1,959,518 

Jan-83 

1,959,519 

1,975,701 

Feb-83 

1,975,702 

1,991,884 

Mar-83 

1,991,885 

2,008,066 

Apr-83 

2,008,067 

2,024,249 

May-83 

2,024,250 

2,040,431 

Jun-83 

2,040,432 

2,056,614 

Jul-83 

2,056,615 

2,072,796 

Aug-83 

2,072,797 

2,088,978 

Sep-83 

2,088,979 

2,105,161 

Oct-83 

2,105,162 

2,121,343 

Nov-83 

2,121,344 

2,137,526 

Dec-83 

2,137,527 

2,153,708 

Jan-84 

2,153,709 

2,169,988 

Feb-84 

2,169,989 

2,186,268 

Mar-84 

2,186,269 

2,202,547 

Apr-84 

2,202,548 

2,218,827 

May-84 

2,218,828 

2,235,106 

Jun-84 

2,235,107 

2,251,386 

Jul-84 

2,251,387 

2,267,665 

Aug-84 

2,267,666 

2,283,944 

Sep-84 

2,283,945 

2,300,224 

Oct-84 

2,300,225 

2,316,503 

Nov-84 

2,316,504 

2,332,783 

Dec-84 

2,332,784 

2,349,062 
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Jan-85 

2,349,063 

2,366,362 

Feb-85 

2,366,363 

2,383,660 

Mar-85 

2,383,661 

2,400,959 

Apr-85 

2,400,960 

2,418,258 

May-85 

2,418,259 

2,435,556 

Jun-85 

2,435,557 

2,452,855 

Jul-85 

2,452,856 

2,470,154 

Aug-85 

2,470,155 

2,487,452 

Sep-85 

2,487,453 

2,504,751 

Oct-85 

2,504,752 

2,522,050 

Nov-85 

2,522,051 

2,539,348 

Dec-85 

2,539,349 

2,556,647 

Jan-86 

2,556,648 

2,574,834 

Feb-86 

2,574,835 

2,593,019 

Mar-86 

2,593,020 

2,611,205 

Apr-86 

2,611,206 

2,629,391 

May-86 

2,629,392 

2,647,576 

Jun-86 

2,647,577 

2,665,762 

Jul-86 

2,665,763 

2,683,948 

Aug-86 

2,683,949 

2,702,133 

Sep-86 

2,702,134 

2,720,319 

Oct-86 

2,720,320 

2,738,505 

Nov-86 

2,738,506 

2,756,690 

Dec-86 

2,756,691 

2,774,876 

Jan-87 

2,774,877 

2,795,403 

Feb-87 

2,795,404 

2,815,929 

Mar-87 

2,815,930 

2,836,455 

Apr-87 

2,836,456 

2,856,981 

May-87 

2,856,982 

2,877,507 

Jun-87 

2,877,508 

2,898,033 

Jul-87 

2,898,034 

2,918,559 

Aug-87 

2,918,560 

2,939,085 

Sep-87 

2,939,086 

2,959,611 

Oct-87 

2,959,612 

2,980,137 

Nov-87 

2,980,138 

3,000,663 

Dec-87 

3,000,664 

3,021,189 

Jan-88 

3,021,190 

3,041,067 

Feb-88 

3,041,068 

3,060,944 

Mar-88 

3,060,945 

3,080,820 

Apr-88 

3,080,821 

3,100,697 

May-88 

3,100,698 

3,120,574 

Jun-88 

3,120,575 

3,140,451 

Jul-88 

3,140,452 

3,160,327 

Aug-88 

3,160,328 

3,180,204 

Sep-88 

3,180,205 

3,200,081 

Oct-88 

3,200,082 

3,219,958 

Nov-88 

3,219,959 

3,239,834 

Dec-88 

3,239,835 

3,259,711 

Jan-89 

3,259,712 

3,280,286 

Feb-89 

3,280,287 

3,300,859 
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Mar-89 

3,300,860 

3,321,433 

Apr-89 

3,321,434 

3,342,006 

May-89 

3,342,007 

3,362,580 

Jun-89 

3,362,581 

3,383,153 

Jul-89 

3,383,154 

3,403,727 

Aug-89 

3,403,728 

3,424,300 

Sep-89 

3,424,301 

3,444,874 

Oct-89 

3,444,875 

3,465,447 

Nov-89 

3,465,448 

3,486,021 

Dec-89 

3,486,022 

3,506,594 

Jan-90 

3,506,595 

3,528,124 

Feb-90 

3,528,125 

3,549,653 

Mar-90 

3,549,654 

3,571,182 

Apr-90 

3,571,183 

3,592,711 

May-90 

3,592,712 

3,614,240 

Jun-90 

3,614,241 

3,635,769 

Jul-90 

3,635,770 

3,657,297 

Aug-90 

3,657,298 

3,678,826 

Sep-90 

3,678,827 

3,700,355 

Oct-90 

3,700,356 

3,721,884 

Nov-90 

3,721,885 

3,743,413 

Dec-90 

3,743,414 

3,764,942 

Jan-91 

3,764,943 

3,786,594 

Feb-91 

3,786,595 

3,808,245 

Mar-91 

3,808,246 

3,829,896 

Apr-91 

3,829,897 

3,851,547 

May-91 

3,851,548 

3,873,198 

Jun-91 

3,873,199 

3,894,849 

Jul-91 

3,894,850 

3,916,499 

Aug-91 

3,916,500 

3,938,150 

Sep-91 

3,938,151 

3,959,801 

Oct-91 

3,959,802 

3,981,452 

Nov-91 

3,981,453 

4,003,103 

Dec-91 

4,003,104 

4,024,754 

Jan-92 

4,024,755 

4,049,002 

Feb-92 

4,049,003 

4,073,248 

Mar-92 

4,073,249 

4,097,495 

Apr-92 

4,097,496 

4,121,741 

May-92 

4,121,742 

4,145,988 

Jun-92 

4,145,989 

4,170,235 

Jul-92 

4,170,236 

4,194,481 

Aug-92 

4,194,482 

4,218,728 

Sep-92 

4,218,729 

4,242,974 

Oct-92 

4,242,975 

4,267,221 

Nov-92 

4,267,222 

4,291,467 

Dec-92 

4,291,468 

4,315,714 

Jan-93 

4,315,715 

4,336,048 

Feb-93 

4,336,049 

4,356,381 

Mar-93 

4,356,382 

4,376,714 

Apr-93 

4,376,715 

4,397,047 
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May-93 

4,397,048 

4,417,380 

Jun-93 

4,417,381 

4,437,713 

Jul-93 

4,437,714 

4,458,046 

Aug-93 

4,458,047 

4,478,379 

Sep-93 

4,478,380 

4,498,712 

Oct-93 

4,498,713 

4,519,045 

Nov-93 

4,519,046 

4,539,378 

Dec-93 

4,539,379 

4,559,711 

Jan-94 

4,559,712 

4,583,124 

Feb-94 

4,583,125 

4,606,536 

Mar-94 

4,606,537 

4,629,947 

Apr-94 

4,629,948 

4,653,359 

May-94 

4,653,360 

4,676,771 

Jun-94 

4,676,772 

4,700,183 

Jul-94 

4,700,184 

4,723,594 

Aug-94 

4,723,595 

4,747,006 

Sep-94 

4,747,007 

4,770,418 

Oct-94 

4,770,419 

4,793,830 

Nov-94 

4,793,831 

4,817,241 

Dec-94 

4,817,242 

4,840,653 

Jan-95 

4,840,654 

4,866,069 

Feb-95 

4,866,070 

4,891,484 

Mar-95 

4,891,485 

4,916,900 

Apr-95 

4,916,901 

4,942,315 

May-95 

4,942,316 

4,967,730 

Jun-95 

4,967,731 

4,993,145 

Jul-95 

4,993,146 

5,018,560 

Aug-95 

5,018,561 

5,043,975 

Sep-95 

5,043,976 

5,069,391 

Oct-95 

5,069,392 

5,094,806 

Nov-95 

5,094,807 

5,120,221 

Dec-95 

5,120,222 

5,145,636 

Jan-96 

5,145,637 

5,171,599 

Feb-96 

5,171,600 

5,197,561 

Mar-96 

5,197,562 

5,223,522 

Apr-96 

5,223,523 

5,249,484 

May-96 

5,249,485 

5,275,446 

Jun-96 

5,275,447 

5,301,408 

Jul-96 

5,301,409 

5,327,369 

Aug-96 

5,327,370 

5,353,331 

Sep-96 

5,353,332 

5,379,293 

Oct-96 

5,379,294 

5,405,255 

Nov-96 

5,405,256 

5,431,216 

Dec-96 

5,431,217 

5,457,178 

Jan-97 

5,457,179 

5,484,726 

Feb-97 

5,484,727 

5,512,273 

Mar-97 

5,512,274 

5,539,819 

Apr-97 

5,539,820 

5,567,366 

May-97 

5,567,367 

5,594,913 

Jun-97 

5,594,914 

5,622,460 


162 



Jul-97 

5,622,461 

5,650,006 

Aug-97 

5,650,007 

5,677,553 

Sep-97 

5,677,554 

5,705,100 

Oct-97 

5,705,101 

5,732,647 

Nov-97 

5,732,648 

5,760,193 

Dec-97 

5,760,194 

5,787,740 

Jan-98 

5,787,741 

5,816,212 

Feb-98 

5,816,213 

5,844,684 

Mar-98 

5,844,685 

5,873,155 

Apr-98 

5,873,156 

5,901,627 

May-98 

5,901,628 

5,930,098 

Jun-98 

5,930,099 

5,958,570 

Jul-98 

5,958,571 

5,987,041 

Aug-98 

5,987,042 

6,015,512 

Sep-98 

6,015,513 

6,043,984 

Oct-98 

6,043,985 

6,072,455 

Nov-98 

6,072,456 

6,100,927 

Dec-98 

6,100,928 

6,129,398 

Jan-99 

6,129,399 

6,158,933 

Feb-99 

6,158,934 

6,188,467 

Mar-99 

6,188,468 

6,218,001 

Apr-99 

6,218,002 

6,247,535 

May-99 

6,247,536 

6,277,069 

Jun-99 

6,277,070 

6,306,603 

Jul-99 

6,306,604 

6,336,137 

Aug-99 

6,336,138 

6,365,671 

Sep-99 

6,365,672 

6,395,205 

Oct-99 

6,395,206 

6,424,739 

Nov-99 

6,424,740 

6,454,273 

Dec-99 

6,454,274 

6,483,807 

Jan-00 

6,483,808 

6,509,866 

Feb-00 

6,509,867 

6,535,924 

Mar-00 

6,535,925 

6,561,981 

Apr-00 

6,561,982 

6,588,039 

May-00 

6,588,040 

6,614,097 

Jun-00 

6,614,098 

6,640,155 

Jul-00 

6,640,156 

6,666,212 

Aug-00 

6,666,213 

6,692,270 

Sep-00 

6,692,271 

6,718,328 

Oct-00 

6,718,329 

6,744,386 

Nov-00 

6,744,387 

6,770,443 

Dec-00 

6,770,444 

6,796,501 
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APPENDIX M - TERM COMBINATION MACRO CODE 


' Macro: sortTrigger 

' Description: This macro uses the descriptors sheet as an input and calls 
' subroutines to combine the n-tuple terms where 
' 1-tuple is the sheet of single descriptors 

' 2-tuple is the sheet of doubles 
' 3-tuple is the sheet of triples 

' etc. 

' this macro handles up to 32 tuple 

Sub sortTrigger() 

descriptorSheet = "n-Terms-01" 

'descriptorSheet = "Descriptors (Cleaned)" 

'descriptorSheet = "Top4Records" 

'descriptorSheet = "Top32Records" 

'Accession number range — First year —> Last time step 
ANStart = 0 
AN_End= 116796501 

Call doublesort(descriptorSheet, AN Start, AN End) 

Call triplesort(descriptorSheet, AN Start, AN End) 

Call quadsort(descriptorSheet, AN Start, AN End) 

Call fivesort(descriptorSheet, AN Start, AN End) 

Call sixsort(descriptorSheet, AN Start, AN End) 

Call sevensort(descriptorSheet, AN Start, AN End) 

Call eightsort(descriptorSheet, AN Start, AN End) 

Call ninesort(descriptorSheet, AN Start, AN End) 

Call tensort(descriptorSheet, AN Start, AN End) 

Call elevensort(descriptorSheet, AN Start, AN End) 

Call twelvesort(descriptorSheet, AN Start, AN End) 

Call thirteensort(descriptorSheet, AN Start, AN End) 

Call fourteensort(descriptorSheet, AN Start, AN End) 

Call fifteensort(descriptorSheet, AN Start, AN End) 

Call sixteensort(descriptorSheet, AN Start, AN End) 

Call seventeensort(descriptorSheet, AN Start, AN End) 

Call eighteensort(descriptorSheet, AN Start, AN End) 

Call Nineteensort(descriptorSheet, AN Start, AN End) 

Call twentysort(descriptorSheet, AN Start, AN End) 

Call twentylsort(descriptorSheet, AN Start, AN End) 

Call twenty2sort(descriptorSheet, AN Start, AN End) 

Call twenty3sort(descriptorSheet, AN Start, AN End) 

Call twenty4sort(descriptorSheet, AN Start, AN End) 

Call twenty5sort(descriptorSheet, AN Start, AN End) 

Call twenty6sort(descriptorSheet, AN Start, AN End) 

Call twenty7sort(deseriptorSheet, AN Start, AN End) 

Call twenty8sort(deseriptorSheet, AN Start, AN End) 

Call twenty9sort(deseriptorSheet, AN Start, AN End) 
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Call thirtysort(descriptorSheet, AN Start, AN End) 

Call thirty lsort(descriptorSheet, AN Start, AN End) 

Call thirty2sort(descriptorSheet, AN Start, AN End) 

End Sub 

Sub sortTriggerMore25() 

descriptorSheet = "Descriptors (Cleaned)" 

Call twenty6sort(descriptorSheet, AN Start, AN End) 

Call twenty7sort(descriptorSheet, AN Start, AN End) 

Call twenty8sort(descriptorSheet, AN Start, AN End) 

Call twenty9sort(descriptorSheet, AN Start, AN End) 

Call thirtysort(descriptorSheet, AN Start, AN End) 

Call thirty lsort(descriptorSheet, AN Start, AN End) 

Call thirty2sort(descriptorSheet, AN Start, AN End) 

End Sub 

Sub doublesort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
doubles = "n-Terms-02" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = doubles 

nRowsAN = CountRows(accession, 1) 'Find number of rows 
k = 1 'row counter for new double sheet 


For i = 2 To nRowsAN 
j =i+ 1 

accessiouNum = Sheets(accession).Cells(i, 1) 

' If (accessiouNum >= AN Start And accessiouNum <= AN End) Then 
Do Until Not (accessiouNum = Sheets(accession).Cells(j, 1)) 

Sheets(doubles).Cells(k, 1) = accessiouNum 

Sheets(doubles).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) 
k = k+ 1 

j=j + l 

Ifk = 65535 Then 

doubles = "n-Terms-02_2" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = doubles 
k= 1 
End If 
Loop 
' End If 
Next i 

End Sub 

Sub triplesort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 
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accession = descriptorSheet 
triple = "n-Terms-03" 


Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = triple 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for triple sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(l, 1)) 

Sheets(triple).Cells(k, 1) = accessionNum 

Sheets(triple).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " 
" & Sheets(accession).Cells(l, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 

Ifk = 65500 Then 
triple = "n-Terms-03_2" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = triple 
k= 1 
End If 

Loop 
'End If 
Next i 

End Sub 

Sub quadsort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
quad = "n-Terms-04" 


Sheets. Add after~Sheets(Sheets.Count) 

Sheets( Sheets.Count). S elect 
ActiveSheet.Name = quad 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = 1 + 1 

accessionNum = Sheets(accession).Cells(i, 1) 
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'If (accessionNum >= AN Start And accessionNum <= AN End) Then 

Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1)) 

Sheets(quad).Cells(k, 1) = accessionNum 

Sheets(quad).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & " 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
Ifk = 65500 Then 

quad = "n-Terms-04_2" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = quad 
k= 1 
End If 
Loop 
'End If 
Next i 
End Sub 

Sub fivesort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
pent = "n-Terms-05" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = pent 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 

Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1)) 

Sheets(pent).Cells(k, 1) = accessionNum 

Sheets(pent).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 
Sheets(accession).Cells(n, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 

Ifk = 65500 Then 
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pent = "n-Terms-05_2" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = pent 
k= 1 
End If 

Loop 
'End If 
Next i 

End Sub 

Sub sixsort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
six = "n-Terms-06" 


Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = six 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1)) 

Sheets(six).Cells(k, 1) = accessionNum 

Sheets(six).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & ";" 
& Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & Sheets(accession).Cells(n, 
2) & ";" & Sheets(accession).Cells(o, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 

Ifk = 65500 Then 
six = "n-Terms-06_2" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = six 
k= 1 
End If 
Loop 
'End If 
Next i 
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End Sub 

Sub sevensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

aceession = deseriptorSheet 
seven = "n-Terms-07" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = seven 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 

accessiouNum = Sheets(accession).Cells(i, 1) 

'If (accessiouNum >= AN Start And accessiouNum <= AN End) Then 
Do Until Not (accessiouNum = Sheets(accession).Cells(j, 1) And accessiouNum = 
Sheets(accession).Cells(l, 1) And accessiouNum = Sheets(accession).Cells(m, 1) And accessiouNum = 
Sheets(accession).Cells(n, 1) And accessiouNum = Sheets(accession).Cells(o, 1) And accessiouNum = 
Sheets(accession).Cells(p, 1)) 

Sheets(seven).Cells(k, 1) = accessiouNum 

Sheets(seven).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " ; 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 
p=p+l 

Loop 
'End If 
Next i 

End Sub 

Sub eightsort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
eight = "n-Terms-08" 


Sheets. Add after:=Sheets(Sheets.Count) 
Sheets(Sheets.Count).Select 
ActiveSheet.Name = eight 

uRowsAN = CountRows(accession, 1) 'Find number of rows 


k = 1 'row counter for new quad combo sheet 
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For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + 1 

accessionNum = Sheets(accessioii).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1)) 

Sheets(eight).Cells(k, 1) = accessionNum 

Sheets(eight).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & "; 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 
p=p+l 
q = q+l 
Loop 
'End If 
Next i 

End Sub 

Sub ninesort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
nine = "n-Terms-09" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = nine 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q + 1 

accessionNum = Sheets(accession).Cells(i, 1) 
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'If (accessionNum >= AN Start And accessionNum <= AN End) Then 

Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1)) 

Sheets(nine).Cells(k, 1) = accessionNum 

Sheets(nine).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " ; 

" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 
p=p+l 
q = q+l 
r = r+ 1 
Loop 
'End If 
Next i 

End Sub 

Sub tensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
ten = "n-Terms-10" 


Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = ten 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 

Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
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Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1)) 

Sheets(ten).Cells(k, 1) = accessionNum 

Sheets(ten).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & "; " 
& Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & Sheets(accession).Cells(n, 
2) & ";" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) & ";" & 
Sheets(accession).Cells(q, 2) & " ;" & Sheets(accession).Cells(r, 2) & ";" & Sheets(accession).Cells(s, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 
p=p+l 
q = q+l 
r = r+ 1 
s = s + 1 
Loop 
'End If 
Next i 

End Sub 

Sub elevensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
eleven = "n-Terms-11" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = eleven 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1)) 

Sheets(eleven).Cells(k, 1) = accessionNum 
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Sheets(eleven).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & " 
;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accessioii).Cells(p, 2) 
& " ;" & Sheets(accessioii).Cells(q, 2) & ";" & Sheets(accessioii).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accessioii).Cells(t, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0 = 0+1 
p=p+l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
Loop 
'End If 
Next i 

End Sub 

Sub twelvesort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

aeeession = deseriptorSheet 
twelve = "n-Terms-12" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = twelve 

uRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t + 1 

aeeessionNum = Sheets(aeeession).Cells(i, 1) 

'If (aeeessionNum >= AN Start And aeeessionNum <= AN End) Then 
Do Until Not (aeeessionNum = Sheets(aeeession).Cells(j, 1) And aeeessionNum = 
Sheets(aeeession).Cells(l, 1) And aeeessionNum = Sheets(aeeession).Cells(m, 1) And aeeessionNum = 
Sheets(aeeession).Cells(n, 1) And aeeessionNum = Sheets(aeeession).Cells(o, 1) And aeeessionNum = 
Sheets(aeeession).Cells(p, 1) And aeeessionNum = Sheets(aeeession).Cells(q, 1) And aeeessionNum = 
Sheets(aeeession).Cells(r, 1) And aeeessionNum = Sheets(aeeession).Cells(s, 1) And aeeessionNum = 
Sheets(aeeession).Cells(t, 1) And aeeessionNum = Sheets(aeeession).Cells(u, 1)) 

Sheets(twelve).Cells(k, 1) = aeeessionNum 
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Sheets(twelve).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accessioii).Cells(j, 2) & " ; 
" & Sheets(accessioii).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accessioii).Cells(p, 2) 
& " ;" & Sheets(accessioii).Cells(q, 2) & ";" & Sheets(accessioii).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accessioii).Cells(t, 2) & " ;" & Sheets(accessioii).Cells(u, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
Loop 
'End If 
Next i 

End Sub 

Sub thirteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
thirteen = "n-Terms-13" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = thirteen 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
V =u + 1 

aeeessiouNum = Sheets(aeeession).Cells(i, 1) 

'If (aeeessiouNum >= AN Start And aeeessiouNum <= AN End) Then 
Do Until Not (aeeessiouNum = Sheets(aeeession).Cells(j, 1) And aeeessiouNum = 
Sheets(aeeession).Cells(l, 1) And aeeessiouNum = Sheets(aeeession).Cells(m, 1) And aeeessiouNum = 
Sheets(aeeession).Cells(n, 1) And aeeessiouNum = Sheets(aeeession).Cells(o, 1) And aeeessiouNum = 
Sheets(aeeession).Cells(p, 1) And aeeessiouNum = Sheets(aeeession).Cells(q, 1) And aeeessiouNum = 
Sheets(aeeession).Cells(r, 1) And aeeessiouNum = Sheets(aeeession).Cells(s, 1) And aeeessiouNum = 
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Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1)) 

Sheets(thirteen).Cells(k, 1) = accessionNum 

Sheets(thirteen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " ; 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
Loop 
'End If 
Next i 

End Sub 

Sub fourteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

accession = descriptorSheet 
fourteen = "n-Terms-14" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = fourteen 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 

accessionNum = Sheets(accession).Cells(i, 1) 
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'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1)) 

Sheets(fourteen).Cells(k, 1) = accessionNum 

Sheets(fourteen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & " 
;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
Loop 
'End If 
Next i 

End Sub 

Sub fifteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
fifteen = "n-Terms-15" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets( Sheets.Count). S elect 
ActiveSheet.Name = fifteen 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
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s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 

accessionNum = Sheets(accessioii).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1)) 

Sheets(fifleen).Cells(k, 1) = accessionNum 

Sheets(fifleen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & "; 

" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
Loop 
'End If 
Next i 

End Sub 

Sub sixteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
sixteen = "n-Terms-16" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = sixteen 

uRowsAN = CountRows(accession, 1) 'Find number of rows 
k = 1 'row counter for new quad combo sheet 
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For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1)) 

Sheets(sixteen).Cells(k, 1) = accessionNum 

Sheets(sixteen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " ; 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 

Loop 
'End If 
Next i 

End Sub 
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Sub seventeensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

accession = descriptorSheet 
seventeen = "n-Terms-17" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = seventeen 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1)) 

Sheets(seventeen).Cells(k, 1) = accessionNum 

Sheets(seventeen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & 
";" & Sheets(accession).Cells(l, 2) & ";" & Sheets(accession).Cells(m, 2) & " ;" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
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p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
Loop 
'End If 
Next i 

End Sub 

Sub eighteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
eighteen = "n-Terms-18" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count). Seleet 
AetiveSheet.Name = eighteen 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 

aeeessionNum = Sheets(aeeession).Cells(i, 1) 

'If (aeeessionNum >= AN Start And aeeessionNum <= AN End) Then 
Do Until Not (aeeessionNum = Sheets(aeeession).Cells(j, 1) And aeeessionNum = 
Sheets(aeeession).Cells(l, 1) And aeeessionNum = Sheets(aeeession).Cells(m, 1) And aeeessionNum = 
Sheets(aeeession).Cells(n, 1) And aeeessionNum = Sheets(aeeession).Cells(o, 1) And aeeessionNum = 
Sheets(aeeession).Cells(p, 1) And aeeessionNum = Sheets(aeeession).Cells(q, 1) And aeeessionNum = 
Sheets(aeeession).Cells(r, 1) And aeeessionNum = Sheets(aeeession).Cells(s, 1) And aeeessionNum = 
Sheets(aeeession).Cells(t, 1) And aeeessionNum = Sheets(aeeession).Cells(u, 1) And aeeessionNum = 
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Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1) And accessionNum = Sheets(accession).Cells(aa, 1)) 

Sheets(eighteen).Cells(k, 1) = accessionNum 

Sheets(eighteen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " 
;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
& " ;" & Sheets(accession).Cells(aa, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
Loop 
'End If 
Next i 

End Sub 

Sub Nineteensort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

accession = descriptorSheet 
nineteen = "n-Terms-19" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets( Sheets.Count). S elect 
ActiveSheet.Name = nineteen 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
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q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 

accessionNum = Sheets(accessioii).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1) And accessionNum = Sheets(accession).Cells(aa, 1) And accessionNum = 
Sheets(accession).Cells(bb, 1)) 

Sheets(nineteen).Cells(k, 1) = accessionNum 

Sheets(nineteen).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & Sheets(accession).Cells(j, 2) & " 
;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
& " ;" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + l 
Loop 
'End If 
Next i 
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End Sub 

Sub twentysort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

accession = descriptorSheet 
twenty = "n-Terms-20" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = twenty 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1) And accessionNum = Sheets(accession).Cells(aa, 1) And accessionNum = 
Sheets(accession).Cells(bb, 1) And accessionNum = Sheets(accession).Cells(cc, 1)) 

Sheets(twenty).Cells(k, 1) = accessionNum 

Sheets(twenty).Cells(k, 2) = Sheets(accession).Cells(i, 2) & " ;" & Sheets(accession).Cells(j, 2) & " ; 
" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & " ;" & 

Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
& " ;" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 2) & " ;" & 
Sheets(accession).Cells(cc, 2) 
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k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
ii = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
Loop 
'End If 
Next i 

End Sub 

Sub twenty lsort(ByVal deseriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
twenty 1 = "n-Terms-21" 

Sheets. Add after:=Sheets(Sheets.Count) 

Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = twentyl 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
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aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 

accessionNum = Sheets(accessioii).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(dd, 1)) 

'Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1) And accessionNum = Sheets(accession).Cells(aa, 1) And accessionNum = 
Sheets(accession).Cells(bb, 1) And accessionNum = Sheets(accession).Cells(cc, 1) And accessionNum = 
Sheets(accession).Cells(dd, 1)) 

Sheets(twentyl).Cells(k, 1) = accessionNum 

Sheets(twentyl).Cells(k, 2) = Sheets(accession).Cells(i, 2) & "; " & Sheets(accession).Cells(j, 2) & " 

; " & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
& " ;" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 2) & " ;" & 
Sheets(accession).Cells(cc, 2) & ";" & Sheets(accession).Cells(dd, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + l 
cc = cc + 1 
dd = dd + 1 
Loop 
'End If 
Next i 

End Sub 
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Sub twenty2sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aceession = descriptorSheet 
twenty2 = "n-Terms-22" 

Sheets. Add after~Sheets(Sheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = twenty2 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(ee, 1)) 

'Do Until Not (accessionNum = Sheets(accession).Cells(j, 1) And accessionNum = 
Sheets(accession).Cells(l, 1) And accessionNum = Sheets(accession).Cells(m, 1) And accessionNum = 
Sheets(accession).Cells(n, 1) And accessionNum = Sheets(accession).Cells(o, 1) And accessionNum = 
Sheets(accession).Cells(p, 1) And accessionNum = Sheets(accession).Cells(q, 1) And accessionNum = 
Sheets(accession).Cells(r, 1) And accessionNum = Sheets(accession).Cells(s, 1) And accessionNum = 
Sheets(accession).Cells(t, 1) And accessionNum = Sheets(accession).Cells(u, 1) And accessionNum = 
Sheets(accession).Cells(v, 1) And accessionNum = Sheets(accession).Cells(w, 1) And accessionNum = 
Sheets(accession).Cells(x, 1) And accessionNum = Sheets(accession).Cells(y, 1) And accessionNum = 
Sheets(accession).Cells(z, 1) And accessionNum = Sheets(accession).Cells(aa, 1) And accessionNum = 
Sheets(accession).Cells(bb, 1) And accessionNum = Sheets(accession).Cells(cc, 1) And accessionNum = 
Sheets(accession).Cells(dd, 1)) 

Sheets(twenty2).Cells(k, 1) = accessionNum 

Sheets(twenty2).Cells(k, 2) = Sheets(accession).Cells(i, 2) & "; " & Sheets(accession).Cells(j, 2) & " 

; " & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) & ";" & 
Sheets(accession).Cells(n, 2) & " ;" & Sheets(accession).Cells(o, 2) & ";" & Sheets(accession).Cells(p, 2) 
& " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) & " ;" & 
Sheets(accession).Cells(s, 2) & ";" & Sheets(accession).Cells(t, 2) & " ;" & Sheets(accession).Cells(u, 2) 
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& " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 2) & " ;" & 
Sheets(accession).Cells(x, 2) & " ;" & Sheets(accession).Cells(y, 2) & ";" & Sheets(accession).Cells(z, 2) 
& " ;" & Sheets(accessioii).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 2) & " ;" & 
Sbeets(accession).Cells(cc, 2) & " ;" & Sbeets(accession).Cells(dd, 2) & ";" & 

Sbeets(accession).Cells(ee, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
Loop 
'End If 
Next i 

End Sub 

Sub twenty3sort(ByVal descriptorSbeet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

accession = descriptorSbeet 
targetSbeetName = "n-Terms-23" 

Sheets. Add after:=Worksbeets(Worksheets.Count) 

Sheets( Sheets.Count). S elect 
ActiveSheet.Name = targetSbeetName 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
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t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(ff, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 
2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
Loop 
'End If 
Next i 

End Sub '23 
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Sub twenty4sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aceession = deseriptorSheet 
targetSheetName = "n-Terms-24" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = targetSheetName 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 

accessiouNum = Sheets(accession).Cells(i, 1) 

'If (accessiouNum >= AN Start And accessiouNum <= AN End) Then 
Do Until Not (accessiouNum = Sheets(accession).Cells(gg, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessiouNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2) 

k = k+ 1 

j=j + l 
1 = 1+1 
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m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
Loop 
'End If 
Next i 

End Sub '24 

Sub twenty5sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
targetSheetName = "n-Terms-25" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = targetSheetName 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
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y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 

accessionNum = Sheets(accessioii).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(hh, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
hh=hh+ 1 
Loop 
'End If 
Next i 

End Sub ' 25 
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Sub twenty6sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aceession = deseriptorSheet 
targetSheetName = "n-Terms-26" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count).Select 
ActiveSheet.Name = targetSheetName 

uRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 

accessiouNum = Sheets(accession).Cells(i, 1) 

'If (accessiouNum >= AN Start And accessiouNum <= AN End) Then 
Do Until Not (accessiouNum = Sheets(accession).Cells(ii, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessiouNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accession).Cells(ii, 2) 
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k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
ii = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
hh = hh + 1 
ii = ii + 1 
Loop 
'End If 
Next i 

End Sub ' 26 

Sub twenty7sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
targetSheetName = "n-Terms-27" 

Sheets. Add after:=Worksheets(Worksheets.Count) 

Sheets(Sheets.Count). Seleet 
AetiveSheet.Name = targetSheetName 

uRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
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u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj=ii + l '27 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(jj, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accession).Cells(ii, 2) & " ;" & 
Sheets(accession).Cells(jj, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
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gg = gg +1 
hh = hh + 1 
ii = ii + 1 '26 
jj=ij + l'27 
Loop 
'End If 
Next i 

End Sub ' 27 

Sub twenty8sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSheet 
targetSheetName = "n-Terms-28" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count). Seleet 
AetiveSheet.Name = targetSheetName 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
ee = bb + 1 
dd = ee + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj = ii + 1 '27 
kk=jj + 1 '28 

aeeessionNum = Sheets(aeeession).Cells(i, 1) 

'If (aeeessionNum >= AN Start And aeeessionNum <= AN End) Then 
Do Until Not (aeeessionNum = Sheets(aeeession).Cells(kk, 1)) 

Sheets(targetSheetName).Cells(k, 1) = aeeessionNum 
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Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accessioii).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accessioii).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accessioii).Cells(r, 2) 
& " ;" & Sheets(accessioii).Cells(s, 2) & " ;" & Sheets(accessioii).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accessioii).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accessioii).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accessioii).Cells(bb, 
2) & ";" & Sbeets(accession).Cells(cc, 2) & " ;" & Sbeets(accessioii).Cells(dd, 2) & "; " & 
Sbeets(accession).Cells(ee, 2) & ";" & Sbeets(accession).Cells(ff, 2) & ";" & Sbeets(accessioii).Cells(gg, 
2)_ 

& ";" & Sbeets(accession).Cells(bb, 2) & ";" & Sbeets(accession).Cells(ii, 2) & " ;" & 
Sbeets(accession).Cells(jj, 2) & " ;" & Sbeets(accessioii).Cells(kk, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
bb = bb + 1 
ii = ii + 1 '26 
jj=ii + l'27 
kk = kk + 1 '28 
Loop 
'End If 
Next i 

End Sub ' 28 

Sub twenty9sort(ByVal descriptorSbeet As String, ByVal AN Start As Double, ByVal AN End As 
Double) 

aeeession = deseriptorSbeet 
targetSbeetName = "n-Terms-29" 

Sheets. Add after:=Worksbeets(Worksbeets.Count) 

Sbeets(Sbeets.Count).Seleet 
AetiveSbeet.Name = targetSbeetName 
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nRowsAN = CouiitRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj = ii + 1 '27 
kk=jj + 1 '28 
11 = kk + 1 '29 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(ll, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accession).Cells(ii, 2) & " ;" & 
Sheets(accession).Cells(jj, 2) & " ;" & Sheets(accession).Cells(kk, 2) & ";" & Sheets(accession).Cells(ll, 
2) 

k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
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0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
hh = hh + 1 
ii = ii + 1 '26 
jj=jj + l'27 
kk = kk + 1 '28 
11 = 11 + 1 '29 
Loop 
'End If 
Next i 

End Sub ' 29 

Sub thirtysort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

aeeession = deseriptorSheet 
targetSheetName = "n-Terms-30" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = targetSheetName 

uRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
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X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj = ii + 1 '27 
kk=jj + 1 '28 
11 = kk + 1 '29 
mm = 11 + 1 '30 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(mm, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accession).Cells(ii, 2) & " ;" & 
Sheets(accession).Cells(jj, 2) & " ;" & Sheets(accession).Cells(kk, 2) & ";" & Sheets(accession).Cells(ll, 
2) & ";" & Sheets(accession).Cells(mm, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
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ff=ff+1 
gg = gg +1 
hh=hh+ 1 
ii = ii + 1 '26 
jj=ij + l'27 
kk = kk + 1 '28 
11 = 11 + 1 '29 
mm = mm + 1 '30 
Loop 
'End If 
Next i 

End Sub ' 30 

Sub thirty lsort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 

aeeession = deseriptorSheet 
targetSheetName = "n-Terms-31" 

Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count). Seleet 
AetiveSheet.Name = targetSheetName 

nRowsAN = CountRows(aeeession, 1) 'Find number of rows 

k = 1 'row eounter for new quad eombo sheet 

For i = 2 To nRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o = n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u = t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
ee = bb + 1 
dd = ee + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj = ii + 1 '27 
kk=jj + 1 '28 
11 = kk + 1 '29 
mm = 11 + 1 '30 
nn = mm + 1 '31 
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accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(nn, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 

2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accession).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accession).Cells(ii, 2) & " ;" & 
Sheets(accession).Cells(jj, 2) & " ;" & Sheets(accession).Cells(kk, 2) & ";" & Sheets(accession).Cells(ll, 
2) & ";" & Sheets(accession).Cells(mm, 2) & "; " & Sheets(accession).Cells(nn, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0+1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
hh = hh + 1 
ii = ii + 1 '26 
jj=jj + l'27 
kk = kk + 1 '28 
11 = 11 + 1 '29 
mm = mm + 1 '30 
nn = nn + 1 '31 
Loop 
'End If 
Next i 

End Sub'31 

Sub thirty2sort(ByVal descriptorSheet As String, ByVal AN Start As Double, ByVal AN End As Double) 
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accession = descriptorSheet 
targetSheetName = "n-Terms-32 


Sheets. Add after~Worksheets(Worksheets.Count) 

Sheets(Sheets.Count). Select 
ActiveSheet.Name = targetSheetName 

nRowsAN = CountRows(accession, 1) 'Find number of rows 

k = 1 'row counter for new quad combo sheet 

For i = 2 To uRowsAN 
j=i+l 
l=j + l 
m = l+ 1 
n = m + 1 
o =n + 1 
p = o + 1 
q = p + l 
r = q+ 1 
s = r + 1 
t = s + 1 
u=t+ 1 
v=u+ 1 
W = V + 1 
X = w + 1 
y = x+ 1 
z = y + 1 
aa = z + 1 
bb = aa + 1 
cc = bb + 1 
dd = cc + 1 
ee = dd + 1 '22 
ff = ee + 1 
gg = ff+l'24 
hh = gg + 1 '25 
ii = hh + 1 '26 
jj = ii + 1 '27 
kk=jj + 1 '28 
11 = kk + 1 '29 
mm = 11 + 1 '30 
nn = mm + 1 '31 
00 = nn + 1 '32 

accessionNum = Sheets(accession).Cells(i, 1) 

'If (accessionNum >= AN Start And accessionNum <= AN End) Then 
Do Until Not (accessionNum = Sheets(accession).Cells(oo, 1)) 

Sheets(targetSheetName).Cells(k, 1) = accessionNum 
Sheets(targetSheetName).Cells(k, 2) = Sheets(accession).Cells(i, 2) & ";" & 
Sheets(accession).Cells(j, 2) & " ;" & Sheets(accession).Cells(l, 2) & " ;" & Sheets(accession).Cells(m, 2) 
& " ;" & Sheets(accession).Cells(n, 2) & ";" & Sheets(accession).Cells(o, 2) & " ;" & 
Sheets(accession).Cells(p, 2) & " ;" & Sheets(accession).Cells(q, 2) & ";" & Sheets(accession).Cells(r, 2) 
& " ;" & Sheets(accession).Cells(s, 2) & " ;" & Sheets(accession).Cells(t, 2) & " ;" & 
Sheets(accession).Cells(u, 2) & " ;" & Sheets(accession).Cells(v, 2) & ";" & Sheets(accession).Cells(w, 
2) & ";" & Sheets(accession).Cells(x, 2) & ";" & Sheets(accession).Cells(y, 2) & ";" & 
Sheets(accession).Cells(z, 2) & ";" & Sheets(accession).Cells(aa, 2) & ";" & Sheets(accession).Cells(bb, 
2) & ";" & Sheets(accession).Cells(cc, 2) & " ;" & Sheets(accession).Cells(dd, 2) & "; " & 
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Sheets(accession).Cells(ee, 2) & ";" & Sheets(accession).Cells(ff, 2) & ";" & Sheets(accessioii).Cells(gg, 
2)_ 

& ";" & Sheets(accession).Cells(hh, 2) & ";" & Sheets(accessioii).Cells(ii, 2) & " ;" & 
Sheets(accession).Cells(jj, 2) & " ;" & Sheets(accessioii).Cells(kk, 2) & ";" & Sheets(accession).Cells(ll, 
2) & ";" & Sheets(accession).Cells(mm, 2) & " ;" & Sheets(accessioii).Cells(mi, 2) & ";" & 
Sheets(accession).Cells(oo, 2) 
k = k+ 1 

j=j + l 
1 = 1+1 
m = m + 1 
n = n+ 1 
0=0 + 1 
p=p + l 
q = q+l 
r = r+ 1 
s = s + 1 
t = t+ 1 
u=u+ 1 
v=v+ 1 
w = w + 1 
X =x + 1 
y = y+l 
z = z + 1 
aa = aa + 1 
bb=bb + 1 
cc = cc + 1 
dd = dd + 1 
ee = ee + 1 
ff=ff+ 1 
gg = gg + 1 
hh=hh+ 1 
ii = ii + 1 '26 
jj=jj + l'27 
kk = kk + 1 '28 
11 = 11 + 1 '29 
mm = mm + 1 '30 
nn = rni + 1 '31 
00 = 00 + 1 '32 
Loop 
'End If 
Next i 

End Sub ' 32 

' col - returns column letter 
' input column number 

Function col(ByVal columnNumber As Integer) As String 

Select Case columnNumber 
Case 1 
col = "A" 

Case 2 
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col = "B" 
Case 3 
col = "C" 
Case 4 
col = "D" 
Case 5 
col = "E" 
Case 6 
col = "F" 
Case 7 
col = "G" 
Case 8 
col = "H" 
Case 9 
col = "i" 
Case 10 
col = "J" 
Case 11 
col = "K" 
Case 12 
col = "L" 
Case 13 
col = "M" 
Case 14 
col = "N" 
Case 15 
col = "O" 
Case 16 
col = "P" 
Case 17 
col = "Q" 
Case 18 
col = "R" 
Case 19 
col = "S" 
Case 20 
col = "T" 
Case 21 
col = "U" 
Case 22 
col = "V" 
Case 23 
col = "W" 
Case 24 
col = "X" 
Case 25 
col = "Y" 
Case 26 
col = "Z" 
Case 27 
col = "AA" 
Case 28 
col = "AB" 
Case 29 
col = "AC" 
Case 30 
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col = "AD" 
Case 31 
col = "AE" 
Case 32 
col = "AF" 
Case 33 
col = "AG" 
Case 34 
col = "AH" 
Case 35 
col = "AI" 
Case 36 
col = "AJ" 
Case 37 
col = "AK" 
Case 38 
col = "AL" 
Case 39 
col = "AM" 
Case 40 
col = "AN" 
Case 41 
col = "AO" 
Case 42 
col = "AP" 
Case 43 
col = "AQ" 
Case 44 
col = "AR" 
Case 45 
col = "AS" 
Case 46 
col = "AT" 
Case 47 
col = "AU" 
Case 48 
col = "AV" 
Case 49 
col = "AW" 
Case 50 
col = "AX" 
Case 51 
col = "AY" 
Case 52 
col = "AZ" 
Case 53 
col = "BA" 
Case 54 
col = "BB" 
Case 55 
col = "BC" 
Case 56 
col = "BD" 
Case 57 
col = "BE" 
Case others 
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col = "Z" 

End Select 

End Function 

Function CountRows(ByVal sheetName As String, ByVal colNum As Integer) As Double 

On Error Resume Next 

Dim currCell As Range, rowNum As Double 

SheetsC" & sheetName).Select 

If IsNumeric(coINum) Then 
Else 

colNum = 1 
End If 
rowNum = I 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Do While currCell.Value <> "" 
rowNum = rowNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Loop 

CountRows = rowNum - 1 
End Function 
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APPENDIX N - AFFILIATION DISTRIBUTION MARCO CODE 


MACRO: AffiliationDistribution 
Author: Matt Behnke 
Created: 11/5/01 

Deseription: 1) distributes affdiations into bands eorresponding to the mean and 
standard deviation of the number of publieations an affdiation produeed 

2) eomputes the entropy of the bands and the world 

3) ealeulations temperature and pressure of the dataset 


'GLOBAL VARIABLES 


Dim teehnologyName As String 
Dim stepinterval As String 
Dim eurrFilename As String 
Dim datasheet As String 
Dim deseriptorMatrixSheet As String 
Dim descriptorMatrixSheetY As String 
Dim worldEntropySheet As String 
Dim worldEntropySheetY As String 
Dim affiliationDescMatrix As String 


'name of the dataset (ada, java, ete) 

'the time between time steps (months, years) 

'the name of the spreadsheet file 
'sheet that eontains the matrix of affiliations 
'sheet that eontains the matrix of terms (X) 

'sheet that eontains the matrix of terms (opposite ofX) 
'sheet that eontains world entropy 
'sheet that eontains world entropy y (opposite of X) 
'sheet that assoeiates terms to affiliations 


Dim testBandA As Integer 
Dim testBandB As Integer 
Dim testBandC As Integer 
Dim testBandD As Integer 


'the test variables are used to determine if 
'there are reeords in a band 
'the variable stores the number of affilations 
'that are in eaeh band 


'CONSTANTS 

Private Const HYP3_FIT As Integer = 0 
Private Const EXP3_FIT As Integer = 1 
Private Const POW2_FIT As Integer = 2 

Sub a_Test_Run() 'eontrolled the subs that are ran.. 

testBandA = 1 
testBandB = 1 
testBandC = 1 
testBandD = 1 


'sheet name eonstants 

deseriptorMatrixSheet = "descriptordataX" 
deseriptorMatrixSheetY = "deseriptordataY" 
worldEntropySheet = "WorldCumulativeEntropyX" 
worldEntropySheetY = "WorldCumulativeEntropyY" 
affiliationDeseMatrix = "deseriptormatrixaffil" 

'subs 
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Call instancesSummary 'used to check if the world instances match up with the # of instances of 
' the sum of the bands (a + b + c + d) 


End Sub 


Sub: DistributeAffiliations 
Author: Matt Behnke 
Created: 11/5/01 

Description: The sub routine that calls all the sub routines 
inputs: 

Outputs: 


Sub DistributeAffiliationsO 

testBandA = 0 
testBandB = 0 
testBandC = 0 
testBandD = 0 

currFilename = Application.ActiveWorkbook.Name 
datasheet = ActiveSheet.Name 

'sheet name constants 

descriptorMatrixSheet = "descriptordataX" 
descriptorMatrixSheetY = "descriptordataY" 
worldEntropy Sheet = "WorldCumulativeEntropyX" 
worldEntropy Sheet Y = "WorldCumulativeEntropyY" 
affdiatiouDescMatrix = "descriptormatrixaffil" 

technologyName = InputBox("Enter the name of the technology.") 
stepinterval = InputBox("Enter the time between time steps") 


Call CopyMathCadObj 
'put the cumulative values on the sheets: 

Call CalcCumulative(dataSheet) 'datasheet has the num records each affilation produced over time 
Call CalcCumulative(descriptorMatrixSheet) 

Call CalcCumulative(affihationDescMatrix) 

Call CalcCumulative("Affiliation_authors") 

'determine the num of records in each band 
Call AffiliationDistribution 

'use the summary sheet created by Affiliation Distribution to graph the distributions of each band: 
Call CopyDistributionGraph 

'create descriptor data y sheet from descriptor data x sheet: 

Call CreateDescriptorDataY("descriptor data X", "descriptor data Y") 

'compute world entropy sheets x and y (input, output) 

Call ComputeEntropyC'descriptor data X", "World Cumulative Entropy X", 1) 
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Call ComputeEntropy("descriptor_data_Y", "World Cumulative Entropy Y", 2) 

'fills the band stats of the world: 

Call FillBandStatsC'World") 

'eompute nu and psi for the world: 

Call v_eale_v_psi_sheet("World") 

'-BANDS- 

'fill the band with the affiliations and their number of publieations that fit the number of 
'publieations range for that band determined by Affiliation Distribution: 

testBandA = FillBand("A_Band") 

'fill band stats: 

If testBandA > 0 Then 

Call FillBandStats("A_Band") 

'ereate the matrix of affiliation with author instanees 
Call FillBandAuthors("A_Band") 

'ealeulate nu and psi: 

Call v_eale_v_psi_sheet("A_Band") 

'determine the matrix of terms and the number of instanees for the band 
Call FillBandTerms("A_Band") 

'eompute the entropy of the terms in the band 
Call FillBandTermsEntropyC'A Band") 

'ereate a summaty of band., num of publieations, authors, terms, entropy: 

Call affiliationBandSummaryC'A Band") 

End If 


put in a test for eaeh band & store the results, then re-run the stuff 


testBandB = FillBand("B_Band") 

If testBandB > 0 Then 

Call FiIIBandStats("B_Band") 

Call FiIIBandAuthors("B_Band") 

Call v_eale_v_psi_sheet("B_Band") 

Call FiIIBandTerms("B_Band") 

Call FillBandTermsEntropyC'B Band") 
Call affiliationBandSummaryC'B Band") 
End If 

testBandC = FillBand("C_Band") 

If testBandC > 0 Then 
Call FiIIBandStats("C_Band") 

Call FiIIBandAuthors("C_Band") 

Call v_eaIe_v_psi_sheet("C_Band") 

Call FiIIBandTerms("C_Band") 

Call FillBandTermsEntropyC'C Band") 
Call affiliationBandSummaryC'C Band") 
End If 
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testBandD = FillBand("D_Band") 

If testBandD > 0 Then 

Call FillBandStats("D_Band") 

Call FillBandAuthors("D_Band") 

Call v_eale_v_psi_sheet("D_Band") 

Call FillBandTerms("D_Band") 

Call FillBandTermsEntropyC'D Band") 
Call affiliationBandSummaryC'D Band") 
End If 


Call instaneesSummary 'used to eheek if the world instanees mateh up with the # of instanees of 
' the sum of the bands (a + b + e + d) 

Call entropySummary 'for the world 
Call affdiationSummary 'for the world 
Call affdiationSummaryPart2 'eopies graphs and eomputes 
Call affdiationSummaryPartS 'temp and pressure... 

Call CopyABCDGraph 'eopy the abed learning curve graphs 
Call fillMonthsRowTrigger 

If testBandA > 0 Then 

Call CopyBandSummaryGraphs("A_Band") 'entropy summary graphs 
End If 

If testBandB > 0 Then 

Call CopyBandSummaryGraphs("B_Band") 

End If 

If testBandC > 0 Then 

Call CopyBandSummaryGraphs("C_Band") 

End If 

If testBandD > 0 Then 

Call CopyBandSummaryGraphs("D_Band") 

End If 

Call CopyBandSummaryGraphs("World") 

End Sub 


Sub: AffliationDistribution 
Author: Matt Behnke 
Created: 11/5/01 

Description: figures out the division of bands, and the number of affiliations per band 
inputs: 

Outputs: 


Sub AffiliationDistribution() 
uumRows = CountRows(dataSheet, 1) 
Sheets.Add 

sheetName = "Distribution" 
ActiveSheet.Name = sheetName 
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Sheets(sheetName).Cells(l, 1) = "Statistics" 

Sheets(sheetName).Cells(2, l).FormulaRlCl = "Mean" 

Sheets(sheetNameicells(2, 2).Formula = "=AVERAGE("' & datasheet & "'!A2:A" & numRows & ")" 
Sheets(sheetName).Cells(3, 1) = "Stdev" 

Sheets(sheetNameicells(3, 2).Formula = "=STDEV("' & datasheet & "'!A2:A" & numRows & ")" 
Sheets(sheetName).Cells(4, 1) = "Sum" 

Sheets(sheetNameicells(4, 2).Formula = "=SUM("' & datasheet & "'!A2:A" & numRows & ")" 

Sheets(sheetName).Cells(5, 1) = "Count" 

Sheets(sheetName).Cells(5, 2).Formula = numRows - 1 

Sheets(sheetName).Cells(2, 5).Formula = "Calculate Bands" 

Sheets(sheetName).Cells(3, 5).Formula = "Band D" 

Sheets(sheetName).Cells(3, 6).Formula = "Band C" 

Sheets(sheetName).Cells(3, 7).Formula = "Band B" 

Sheets(sheetName).Cells(3, 8).Formula = "Band A" 

Sheets(sheetName).Cells(4, 4).Formula = "from" 

Sheets(sheetName).Cells(5, 4).Formula = "to" 

Sheets(sheetName).Cells(4, 5).Formula = "0" 'Band D from 

Sheets(sheetName).Cells(5, 5) = "=ROUND(B2+B3,3)" 'band d to 

Sheets(sheetName).Cells(4, 6) = "=ROlJND(B2+B3,3)" 'band c from 

Sheets(sheetNameicells(5, 6) = "=ROlJND(B2+B3*2,3)" 'band c to 

Sheets(sheetName).Cells(4, 7) = "=ROlJND(B2+B3*2,3)" 'band b from 

Sheets(sheetName).Cells(5, 7) = "=ROUND(B2+B3*3,3)" 'band b to 

Sheets(sheetName).Cells(4, 8) = "=ROlJND(B2+B3*3,3)" 'band a from 

'bin labels 

Sheets(sheetName).Cells(7, 1) = "Bin" 

Sheets(sheetName).Cells(7, 2) = "Frequency" 

counter = 1 

For i = 1 To Round(Sheets(sheetName).Cells(5, 5).Value, 0) 'get bin values for band A 
Sheets(sheetName).Cells(7 + i, 1) = i 

Sheets(sheetNameicells(7 + i, 2) = "=COlJNTIF('" & datasheet & '"!A2:A" & numRows & ", & 

i &.)" 

counter = counter + 1 
Next i 

Sheets(sheetName).Cells(7 + counter, 1) = Sheets(sheetName).Cells(5, 6) 'put in next bin (band c end) 
Sheets(sheetName).Cells(l, 9).Formula = "=COUNTIF('" & datasheet & '"!A2:A" & numRows & ", 

""<" & Sheets(sheetName).Cells(5, 6) &.)" 

Sheets(sheetName).Cells(l, 10).Formula = "=COUNTIF('" & datasheet & '"!A2:A" & numRows & ", 

""<" & Sheets(sheetName).Cells(4, 6) &.)" 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 
counter = counter + 1 

Sheets(sheetName).Cells(7 + counter, 1) = Sheets(sheetName).Cells(5, 7) 'bandb end 
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Sheets(sheetName).Cells(l, 9).Formula = "=COUNTIF("' & datasheet & "'!A2:A" & numRows & ", 

& Sheets(sheetName).Cells(5, 7) &.)" 

Sheets(sheetName).Cells(l, 10).Formula = "=COUNTIF("' & datasheet & "'!A2:A" & numRows & ", 

& Sheets(sheetName).Cells(4, 7) &.)" 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 
counter = counter + 1 

exitif = False 

If Sheets(sheetName).Cells(5, 7) < 15 Then 'add more bins 15-30... 

Sheets(sheetName).Cells(7 -i- counter, 1) = "15" 

Sheets(sheetName).Cells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & ", 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & ", 
& Sheets(sheetName).Cells(4, 8) &.)" 

Sheets(sheetName).Cells(7 -i- counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 -i- counter, 2) = 0 Then 

Sheets(sheetName).CeIls(7 -i- counter, I) = "> " & Sheets(sheetName).Cells(4, 8) 
Sheets(sheetNameiceIls(7 counter, 2).FormuIa = "=COlJNTIF("' & datasheet & "'!A2:A" & 

numRows & ", & Sheets(sheetName).CeIIs(4, 8) &.)" 

exitif = True 
End If 

counter = counter -I- I 
If exitif = False Then 

Sheets(sheetName).Cells(7 -i- counter, 1) = "20" 

Sheets(sheetNameicells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

tl 20"")" 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 
" 16"")" 

Sheets(sheetName).Cells(7 -i- counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 -i- counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 -i- counter, 1) = "> 15" 

Sheets(sheetNamelcells(7 counter, 2lFormula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 15"")" 
exitif = Tme 
End If 

End If' exitif 
counter = counter -I- 1 

If exitif = False Then 

Sheets(sheetName).Cells(7 -i- counter, 1) = "25" 

Sheets(sheetNamelcells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

tl Mii^_ 25"")" 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 
" ""< 21 "")" 

Sheets(sheetName).Cells(7 -i- counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 -i- counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 -i- counter, 1) = "> 20" 

Sheets(sheetNamelcells(7 counter, 2lFormula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 20"")" 
exitif = Tme 
End If 
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End If' exitif 
counter = counter + 1 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "30" 

Sheets(sheetNameicells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 
", ""<26"")" 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 + counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 + counter, 1) = "> 25" 

Sheets(sheetNameicells(7 + counter, 2).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 25"")" 
exitif = Tme 
End If 

End If' exitif 
counter = counter + 1 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "> 30" 

Sheets(sheetName).Cells(7 + counter, 2) = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

ti 30"")" 

End If 
Else 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "20" 

Sheets(sheetNameicells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

tl 20"")" 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

tl Mii^_ 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 + counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 + counter, 1) = "> 15" 

Sheets(sheetNameicells(7 + counter, 2).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 15"")" 
exitif = Tme 
End If 

End If' exitif 
counter = counter + 1 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "25" 

Sheets(sheetNameicells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

II Mii^_ 25"")" 

Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

II Mii^_ 20"")" 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 + counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 + counter, 1) = "> 20" 

Sheets(sheetNameicells(7 + counter, 2).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 20"")" 
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exitif = True 
End If 

End If' exitif 
counter = counter + I 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "30" 

Sheets(sheetNameicells(l, 9).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 
Sheets(sheetName).Cells(l, 10).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

ti 

Sheets(sheetName).Cells(7 + counter, 2) = Abs(Sheets(sheetName).Cells(l, 9) - 
Sheets(sheetName).Cells(l, 10)) 

If Sheets(sheetName).Cells(7 + counter, 2) = 0 Then 
Sheets(sheetName).Cells(7 + counter, 1) = "> 25" 

Sheets(sheetNameicells(7 + counter, 2).Formula = "=COlJNTIF("' & datasheet & "'!A2:A" & 
numRows & ", "">= 25"")" 
exitif = Tme 
End If 

End If' exitif 
counter = counter + 1 

If exitif = False Then 

Sheets(sheetName).Cells(7 + counter, 1) = "> 30" 

Sheets(sheetName).Cells(7 + counter, 2) = "=COlJNTIF("' & datasheet & "'!A2:A" & numRows & 

ti 30"")" 

End If 
End If 

Call formatSheetForPrint 
End Sub 


Sub: CopyMathCadObj 
Author: Matt Behnke 
Created: 12/5/01 

Description: copies the mathcad onject, for running a curve fit., 
inputs: 

Outputs: 


Sub CopyMathCadObj 0 

Windows("AffiliationMacro.xls").Activate 
Sheets("Mathcad"). Select 

Sheets("Mathcad").Copy Before:=Workbooks(currFilename).Sheets(dataSheet) 
' If ActiveSheet.Name = "Mathcad" Then 
' ActiveSheet.Name = "Mathcad " & band 
' Else 

' MsgBox ("Mathcad sheet rename failed") 

' End If 

End Sub 


' Sub: CopyDistributionGraph 
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Author: Matt Behnke 
Created: 11/5/01 

Deseription: copies the distribution graph from the macro sheet into the data spreadsheet 
inputs: 

Outputs: 


Sub CopyDistributionGraph() 

Application.Display Alerts = False 

numSheets = Sheets.Count 

Windows("AffiliationMacro.xls"). Activate 
Sheets("Affiliation Distribution Sample").Select 

Sheets("Affiliation Distribution Sample").Copy After:=Workbooks(currFilename).Sheets(numSheets) 
Sheets("Affiliation Distribution Sample").Copy After:=Workbooks(currFilename).Sheets(numSheets) 

ActiveChart.SeriesCollection(l). Select 

ActiveChart.SeriesCollection(l).XValues = '-Distribution!R8Cl:R18Cl" 
ActiveChart.SeriesCollection(l). Values = '-Distribution! R8C2:R19C2" 

ActiveChart.ChartTitle.Characters.Text = "Productivity Distribution" & Chr(lO) _ 

& technologyName & " (" & stepinterval & ")" 

Application.Display Alerts = True 

End Sub 


' Sub: ComputeEntropy 
' Author: Matt Behnke 
' Created: 1/28/02 

' Description: Computes the cumulative entropy using the supplied datasheets 
' note number of instances must begin at row 2, column 4.. 

' inputs: datasheet - matrix of the descriptorData.. Y-axis is the terms, X-axis is timesteps, v is # of 
instances 

' time 1, 2, 3, 4. 

' terml v v v 

' term2 v 

' outSheet: name of the sheet that contains the computed entropy. 

' theType: 1) s(x|y), 2) s(y|x) 

' Outputs: 


Sub ComputeEntropy(ByVal datasheet As String, ByVal outSheet As String, ByVal theType As Integer) 

numCols = CountCols(dataSheet, 1) 
numRows = CountRows(dataSheet, 1) 

Sheets.Add 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = outSheet 

Worksheets(outSheet).Move Afler:=Worksheets(dataSheet) 


For i = 1 To numCols 


217 









If i >= 4 And theType = 1 Then 
TotalNumlnstances = Sheets(dataSheet).Cells(numRows + 1, i) 
Elself i >= 4 And theType = 2 Then 

TotalNumlnstanees = Sheets(dataSheet).Cells(numRows + 1,4) 
End If 

For j = 1 To numRows 


If i >= 4 And j >= 2 Then 

numlnstanees = Sheets(dataSheet).Cells(j, i) 

If Not numlnstanees = "" And numlnstanees > 0 Then ' Then 

entropy = -numlnstanees / TotalNumlnstanees * (Log(numInstanees / TotalNumlnstanees) / 

Log(2)) 


Sheets(outSheet).Cells(j, i) = entropy 
End If 

If j = numRows Then 'put in sum of entropy 

Sheets(outSheet).CellsG + 1, i) = "=SUM(" & eol(i) & "2:" & eol(i) & numRows & ")" 
End If 

Else 'eopy terms, eount, first pub date 

Sheets(outSheet).CellsG, i).Value = Sheets(dataSheet).CellsG, i).Value 
End If 
Next) 

Next i 

End Sub 


' Sub: CreateDeseriptorDataY 
' Author: Matt Behnke 
' Created: 1/28/02 

' Deseription: Takes the supplied deseriptor data sheet and ereates the Y part of the (X,Y) world 
' as X inereases y deereases.. a value deereases on the y sheet when a value inereases on the y sheet 

' inputs: datasheet - matrix of the deseriptorData.. Y-axis is the terms, X-axis is timesteps, v is # of 
instanees 

' time 1, 2, 3, 4. 

' terml v v v 

' term2 v 

' outSheet: name of the sheet that eontains DeseriptorDataY 
' Outputs: 


Sub CreateDeseriptorDataY(ByVal datasheet As String, ByVal outSheet As String) 

numCols = CountCols(dataSheet, 1) 
numRows = CountRows(dataSheet, 1) 

Worksheets(dataSheet).Copy After:=Worksheets(dataSheet) 

Sheets(dataSheet & " (2)").Seleet 
AetiveSheet.Name = outSheet 

For i = 2 To numRows 

For j = 4 To numCols 
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numTotallnstances = Sheets(outSheet).Cells(i, 2) 
numlnstances = Sheets(outSheet).Cells(i, j) 


If j = 4 And Sheets(outSheet).Cells(i, j) > 0 Then 'places the initial value at the end.. 

lastColumn = Sheets(outSheet).Cells(i, j) 

End If 

Sheets(outSheet).Cells(i, j) = numTotallnstances - numlnstances 
If j = numCols And lastColumn > 0 Then 

Sheets(outSheet).Cells(i, j) = lastColumn 'places the value of first column x into last coin Y. 
End If 

If i = numRows Then 'put in sum 

Sheets(outSheet).Cells(i + 1, j) = "=SUM(" & colG) & "2:" & colQ) & numRows & ")" 

End If 
Next j 

lastColumn = 0 
Next i 

End Sub 'CreateDescriptorDataY 


Sub computeEntropyTestO 
'IT WORKS 

'Call ComputeEntropyC'descriptor data X", "World Cumulative Entropy X", 1) 
Call ComputeEntropy("descriptor_data_Y", "World Cumulative Entropy Y", 2) 
'Call CreateDescriptorDataY("descriptor data X", "descriptor data Y") 

End Sub 


Sub: FillBand 
Author: Matt Behnke 
Created: 11/7/01 

Description: fills in a bands distribution by copying a row from the list of all the affiliations (datasheet) 
inputs: band name 

Outputs: num of rows copied 


Function FillBand(ByVal band As String) As Integer 
numRowsCopied = 0 

Sheets .Add After:=Worksheets(Worksheets.Count) 
currsheetName = ActiveSheet.Name 
numRows = CountRows(dataSheet, 1) 
'Sheets(Worksheets.Count). Select 

Columns("C:C").ColumnWidth = 62.43 

Select Case band 
Case "A Band" 

bandFrom = Sheets("Distribution").Cells(4, 8) 
bandTo = 32500 
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Case "B Band" 

bandFrom = Sheets("Distribution").Cells(4, 7) 
bandTo = Sheets("Distributioii").Cells(5, 7) 

Case "C_Band" 

bandFrom = Sheets("Distribution").Cells(4, 6) 
bandTo = Sheets("Distribution").Cells(5, 6) 

Case "D Band" 

bandFrom = Sheets("Distribution").Cells(4, 5) 
bandTo = Sheets("Distribution").Cells(5, 5) 

End Select 

SheetsC" & datasheet & "").Select 
Rows("l:l").Select 
Selection.Copy 
Sheets(currsheetName). Select 
Rows("l:l").Select 
ActiveSheet.Paste 

counter = 2 

For i = 2 To numRows 'copy rows from datasheet into band 

If Sheets(dataSheet).Cells(i, 1) >= bandFrom And Sheets(dataSheet).Cells(i, 1) <= bandTo Then 
Sheets(dataSheet). Select 
Rows(i & & i).Select 

Selection.Copy 
Sheets(currsheetName). Select 
Rows(counter & & counter).Select 

Active Sheet. Paste 
counter = counter + 1 
numRowsCopied = numRowsCopied + 1 
End If 
Next i 

uumRowsInBand = CountRows(currsheetName, 1) 
numColumns = CountCols(currsheetName, 1) 'num time steps 

Cells(numRowsInBand +1,3) = "Count" 

Cells(numRowsInBand + 2, 3) = "Mean" 

Cells(numRowsInBand + 3,3) = "Std Dev" 

Cells(numRowsInBand + 4, 3) = "Sum" 

For i = 4 To numColumns 'put in the mean and std deviation for each time step 
'put in zeros if nothing there 
' For j = 2 To numRowsInBand 
' Ifi=4Then 

' If CellsQ, i) > 0 Then 

' Else 

' Cells(j, i) = 0 

' End If 

' Else 

' Cells(j, i) = Cells(j, i) + Cells(j, i - 1) 

' End If 

' Next) 

'dont put in zeros if nothing there 
' For j = 2 To numRowsInBand 

' If (CellsG, i) > 0 And i > 4) Or (i > 4 And CellsG, i - 1) > 0) Then 
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' Cells(j, i) = Cells(j, i) + Cells(j, i - 1) 

' End If 
' Nextj 

Cells(numRowsInBand + 4, i).Formula = '-Sum(" & col(i) & "2:" & col(i) & uumRowsInBand & ")" 
Cells(numRowsInBand + 1, i).Formula = "=Countif(" & col(i) & "2:" & col(i) & uumRowsInBand & 

If Cells(numRowsInBand + 1, i) > 0 Then 

Cells(numRowsInBand + 2, i).Formula = "=AVERAGE(" & col(i) & "2:" & col(i) & 
uumRowsInBand & ")" 

If Cells(numRowsInBand + 1, i) > 1 Then 'more than one so comput std deviation 
Cells(numRowsInBand + 3, i).Formula = "=STDEV(" & col(i) & "2:" & col(i) & 
uumRowsInBand & ")" 

End If 
End If 
Next i 

ActiveSheet.Name = "Affdiation Cum Dist" & band 
'Call formatSheetForPrint 
FillBand = uumRowsCopied 

End Function 


Sub: FillBand Stats 
Author: Matt Behnke 
Created: 11/7/01 
revised: 12/3/01 

Description: creates a band's statistics sheet 
inputs: band name 

Outputs: 


Sub FillBandStats(ByVal band As String) 

Dim data As Variant 

If band = "World" Then 
source = datasheet 
Else 

source = "Affiliation Cum Dist" & band 
End If 

uumRowsInBand = CountRows(source, 1) 
numTimeStepsInBand = CountCols(source, 1) - 3 

Sheets.Add 

sheetName = band & " Stats" 

ActiveSheet.Name = sheetName '"" & band & " Stats" 

Sheets(sheetName).Move After:=Sheets(source) 

' Sheets(Worksheets.Count).Select 
'Columns("C:C").ColumnWidth = 62.43 

Sheets(sheetName).Cells(5, 1) = " " 
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Sheets(sheetName).Cells(6, 1) = " " 
Sheets(sheetName).Cells(7, 1) = " " 
Sheets(sheetName).Cells(8, 1) = " " 
Sheets(sheetName).Cells(9, 1) = " " 
Sheets(sheetName).Cells(ll, 1) = " " 
Sheets(sheetName).Cells(12, 1) = " " 


Sheets(sheetName).Cells(l, 1) = "Curve fit y(t) y(t) = bCm" 
Sheets(sheetName).Cells(3, 1) = "b" 

Sheets(sheetName).Cells(4, 1) = "m" 

Sheets(sheetName).Cells(8, 3) = "Total Production" 
Sheets(sheetName).Cells(8, 6) = "Production/Step (on Average)" 
Sheets(sheetName).Cells(8, 11) = "Calculated Production/Step)" 

Sheets(sheetName).Cells(10, 1) = "Time Step" 
Sheets(sheetName).Cells(10, 2) = "Step Name" 

Sheets(sheetName).Cells(10, 3) = "Mean" 
Sheets(sheetName).Cells(10, 4) = "Std Deviation" 
Sheets(sheetName).Cells(10, 5) = "'+ 3 sigma" 

'average per step 

Sheets(sheetName).Cells(10, 6) = "Mean" 
Sheets(sheetName).Cells(10, 7) = "Std Deviation" 
Sheets(sheetName).Cells(10, 8) = "'+ 3 sigma" 

Sheets(sheetName).Cells(10, 9) = "Total Prod" 
Sheets(sheetName).Cells(10, 10) = "kappa" 
Sheets(sheetName).Cells(10, 11) = "kappa/2" 
Sheets(sheetName).Cells(10, 12) = "rvalue" 

Sheets(sheetName).Cells(10, 13) = "Mean" 
Sheets(sheetNameicells(10, 14) = "R^2" 
Sheets(sheetName).Cells(10, 15) = "'+ 3 sigma" 


Sheets(sheetName).Cells(13, 1) = "0" 'time step zero 

For i = 1 To numTimeStepsInBand 
SheetsC" & band & "_Stats").Select 
Sheets(sheetName).Cells(13 + i, 1) = i 'step number 

Sheets(sheetName).Cells(13 + i, 2) = Sheets(source).Cells(l, i + 3) 'step name 
'Total Production 

Sheets(sheetName).Cells(13 + i, 3) = 0.000001 + Sheets(source).Cells(numRowsInBand + 2, i + 
3).Value 'mean 

Sheets(sheetName).Cells(13 + i, 4) = 0.000001 + Sheets(source).Cells(numRowsInBand + 3, i + 
3).Value 'stdev 

Sheets(sheetName).Cells(13 + i, 5) = "=C" & 13 + i & "+ 3*D" & 13 + i 'sheets(sheetName).Cells(13 + 
i, 4) 'mean + 3 std 

'Production per step on avg.. 

If i = 1 Then 

'Cells(13 + i, 6) =sheets(sheetName).Cells(13 + i, 3) /sheets(sheetName).Cells(13 + i, 1) 'mean 

Sheets(sheetName).Cells(13 + i, 6) = "=C" & 13 + i 'Cells(13 + i, 3) 

Sheets(sheetName).Cells(13 + i, 7) = "=D" & 13 + i 'Cells(13 + i, 4) 'stdev 
Sheets(sheetName).Cells(13 + i, 8) = "=E" & 13 + i 'sheets(sheetName).Cells(13 + i, 5) 'mean * 3std 
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Else 

Sheets(sheetName).Cells(13 + i, 6) = "=C" & 13 + i & " - C" & 12 + i 'Cells(13 + i, 3) - 
slieets(slieetName).Cells(12 + i, 3) 

Slieets(slieetName).Cells(13 + i, 7) = "=D" & 13 + i & " - D" & 12 + i 'Cells(13 + i, 4) - 
slieets(slieetName).Cells(12 + i, 4) 'stdev 

Slieets(slieetName).Cells(13 + i, 8) = "=E" & 13 + i & " - E" & 12 + i 'Cells(13 + i, 5) - 
sheets(sheetName).Cells(12 + i, 5) 'mean + 3std 
End If 

If i = numTimeStepsInBand Then 'put in average 
Sheets(sheetName).Cells(14 + i, 8) = "=AVERAGE(H14:H" & i + 13 & ")" 

End If 

If i = 1 Then 

Sheets(sheetName).Cells(13 + i, 9) = "=H" & 13 + i 'Cells(13 + i, 8) 

Else 

Sheets(sheetName).Cells(13 + i, 9) = "=H" & 13 + i & " + H" & 12 + i 'Cells(13 + i, 8) 
+sheets(sheetName).Cells(12 + i, 9) 

End If 

Sheets(sheetName).Cells(13 + i, 10) = "=H" & 14 + numTimeStepsInBand 'avg ofmean*3std 
Next i 

'AetiveSheet.Name = "" & band & " Stats" 

Call eopyStatGraphs(numTimeStepsInBand, band, "" & band & " Stats") 

SheetsC" & band & "_Stats").Seleet 

'get formula of trendline from entropy power trend graph 
trendEq = Sheets(sheetName).Cells(2, 1) 

Sheets(sheetName).Cells(3, 2) = firstPartTrendEq(trendEq) 

Sheets(sheetName).Cells(4, 2) = seeondPartTrendEq(trendEq) 

'get kappa, r. p 

Sheets(sheetName).Cells(3, 3) = "kappa" 'headers 
Sheets(sheetName).Cells(4, 3) = "r" 

Sheets(sheetName).Cells(5, 3) = "p" 

Sheets(sheetName).Cells(6, 3) = "1-Sum(r^2)" 

j = 14 'get the start row 

' While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
' i =sheets(sheetName).Cells(j, 3).Value 

' If Not (i > 0) Then 

j=j + l 

' End If 

' Wend 

'perform linear interpolation on eolumns C and D 
'datasheet = band & " Stats" 

Call FillIn]VIissingData(sheetName, 3, 14, numTimeStepsInBand + 13) 

Call FillIn]VIissingData(sheetName, 4, 14, numTimeStepsInBand + 13) 

SheetsC" & band & "_Stats").Seleet 

'j = j + 1 'add one to the starting row to not inelude the first time step.... 
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numRowsToUse = numTimeStepsInBand - (j - 14) 

data = Update_Mathcad_Band_Stats("Mathcad", ActiveSheet.Name, "C" & j, "F" & j, numRowsToUse - 

1 , 0 , 0 . 001 ) 

kappa = data(l) 
r = data(2) 
p = data(3) 
r2a = data(4) 

'put on sheet 

Sheets(sheetName).Cells(3, 4) = Round(kappa, 4) 

Sheets(sheetName).Cells(4, 4) = Rounder, 4) 

Sheets(sheetName).Cells(5, 4) = Round(p, 4) 

Sheets(sheetName).Cells(6, 5) = Round(r2a, 4) 

'ealeulate preditieded means for -1, -2 under total 
'Cells(l 1, 3).Formula = "=-$B$3*-Al U$B$4" 'not needed 
'Cells(12, 3>Formula = "=-$B$3*-A12^$B$4" 

'Cells(13, 3) = 0 


For i = 1 To numTimeStepsInBand 
'fdl in kappa, kappa/2, rvalue 
Sheets(sheetName).Cells(13 + i, 10) = "=$D$3" 

Sheets(sheetName).Cells(13 + i, 11) = "=$D$3 / 2" 

Sheets(sheetName).Cells(13 + i, 12) = "=$D$4" 

'fdl in ealeulated prod per step 

Sheets(sheetName).Cells(13 + i, 13).Formula = "=$D$3*(C" & 13 + i & "+$D$5)/(C" & 13 + i & 
"+$D$4+$D$5)" 'mean 

Sheets(sheetName).Cells(13 + i, 14) = "=(M" & 13 + i & "-F" & 13 + i & ")*(M" & 13 + i & "-F" & 
13 + i & ")" 'R^2 

sumRSquared = Sheets(sheetName).Cells(13 + i, 14) + sumRSquared 
Sheets(sheetName).Cells(13 + i, 15) = "=$D$3*(E" & 13 + i & "+$D$5)/(E" & 13 + i & 
"+$D$4+$D$5)" 

If i = numTimeStepsInBand Then 'put in average R^2-REMOVE AFTER dbl Cheeking 

values. 

Sheets(sheetName).Cells(15 + i, 10) = "Sum(R^2)" 

Sheets(sheetNameicells(15 + i, 12) = "=Sum(N14:N" & i + 13 & ")" 

Sheets(sheetName).Cells(16 + i, 12) = "=1-L" & i + 15 
End If 

Next i 

inverseRSquared = (1 - sumRSquared) 'from the sum of r squared 
Sheets(sheetName).Cells(6, 4) = Round(inverseRSquared, 4) '4deeimal plaees 

' sheets(sheetName).Cells(ll, ll).Formula = "=$D$3*(C11+$D$5)/(C11+$D$4+$D$5)" -removed (-2, - 
1, 0 time steps of ealeulated mean) 

' sheets(sheetName).Cells(12, ll).Formula = "=$D$3*(C12-K$D$5)/(C12-K$D$4-K$D$5)" 

' sheets(sheetNameicells(13, 11) = "=$D$3*(C13-^$D$5)/(C13-^$D$4-K$D$5)" 

'Call formatSheetForPrint 

Call eopyLeamingCum(numTimeStepsInBand, band, "" & band & " Stats") 'learning vs. eum 
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End Sub 'fill stats 


Sub: copyStatGraphs 
Author: Matt Behnke 
Created: 11/7/01 

Deseription: eopyies the affilaition statisties graphs 
inputs: 

Outputs: 


Sub eopyStatGraphs(ByVal timeSteps As Integer, ByVal band As String, ByVal souree As String) 

Applieation.DisplayAlerts = False 

numSheets = Sheets.Count 

Windows("AffiliationMaero.xls").Aetivate 

Sheets("A_Band_Leaming_Cap_per_k").Seleet 

Sheets("A_Band_Leaming_Cap_per_k").Copy After:=Workbooks(currFilename).Sheets(numSheets) 
AetiveChart.SeriesColleetion(l).Seleet 

j = 14 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets(souree).Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

AetiveChart.SeriesColleetion(l).Values = "=" & souree & "!R" & j & "C6:R" & timeSteps + 13 & "C6" 

AetiveChart.SeriesColleetion(2).Values = & souree & "!R" & j + 1 & "C8:R" & timeSteps + 13 & 

"C8" 

AetiveChart.ChartTitle.Charaeters.Text = "" & band & " Produetivity Index (Cum over k)" & Chr(lO) _ 
& teehnologyName & " (" & stepinterval & ")" 

ActiveSheet.Name = "" & band & "_Leaming_Cap_per_k" 

AetiveChart.SeriesColleetion(l).ErrorBars.Seleet 
ExeeuteExeeMMaero _ 

"ERRORBAR.Y(2,5,""=" & souree & "!R" & j & "C7:R" & timeSteps + 13 & 
"C7"",""=A_Band_Stats!$F$" & j & ":$F$" & timeSteps + 13 &.)" 

'move legend and textbox 
AetiveChart.Legend. Seleet 
Seleetion.Left = 431 
Seleetion.Top = 341 

'eopy seeond graph 

Windows("AffiliationMaero.xls").Aetivate 
Sheets("A_Band_Leaming_Cum").Seleet 

Sheets("A_Band_Leaming_Cum").Copy After:=Workbooks(eurrFilename).Sheets(numSheets) 
AetiveChart. SeriesColleetion( 1). Seleet 


225 






ActiveChart.SeriesCollection(l).XValues = & source & "!R" & j & "C1:R" & timeSteps + 13 & 

"Cl" 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & j & "C3:R" & timeSteps + 13 & "C3" 

ActiveChart.SeriesCollection(2).XValues = & source & "!R" & j & "C1:R" & timeSteps + 13 & 

"Cl" 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & j & "C5:R" & timeSteps + 13 & "C5" 


ActiveChart. SeriesCollection( 1). Select 
With ActiveChart. SeriesCollection( 1 ).Trendlines( 1) 

'put trendline equation onto stats sheet 
Worksheets(source).Cells(2, 1).Value = .DataLabel.Text 
•DisplayRSquared = Trae 
End With 


ActiveChart.ChartTitle.Characters.Text = "" & band & " Productivity In Pubs (Cum over k)" & Chr(lO) 

& technologyName & " (" & stepinterval & ")" 

ActiveSheet.Name = "" & band & " Leaming Cum" 

Application.DisplayAlerts = True 

End Sub 

' copies the learning vs cumulative chart. 


Sub copyLeamingCum(ByVal timeSteps As Integer, ByVal band As String, ByVal source As String) 

Application.DisplayAlerts = False 

kappa = Sheets(source).Cells(3, 4) 
r = Sheets(source).Cells(4,4) 
p = Sheets(source).Cells(5, 4) 
r2 = Sheets(source).Cells(6, 4) 

numSheets = Sheets.Count 

WindowsC'AffiliationMacro.xls"). Activate 
Sheets("A_Band_Leaming_Vs_Cum"). Select 

Sheets("A_Band_Leaming_Vs_Cum").Copy After~Workbooks(currFilename _ 

). Sheets(numSheets) 

ActiveChart.PlotArea. Select 


j = 14 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets(source).Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 
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ActiveChart.SeriesCollectioii(l).XValues = & source & "!R" & j & "C3:R" & timeSteps + 13 & 

"C3" 

ActiveChart.SeriesCollectioii(l).Values = & source & "!R" & j & "C6:R" & timeSteps + 13 & "C6" 

ActiveChart.SeriesCollection(2).XValues = & source & "!R" & j & "C3:R" & timeSteps + 13 & 

"C3" 

ActiveChart.SeriesCollection(2).Values = & source & "!R14C13:R" & timeSteps + 13 & "C13" 

'kappa 

ActiveChart.SeriesCollection(3).XValues = & source & "!R" & j & "C5:R" & timeSteps + 13 & 

"C5" 

ActiveChart.SeriesCollection(3).Values = & source & "!R14C10:R" & timeSteps + 13 & "CIO" 

'3 sigma 3 sigma 

ActiveChart.SeriesCollection(4).XValues = & source & "!R" & j & "C5:R" & timeSteps + 13 & 

"C5" 'E 

ActiveChart.SeriesCollection(4).Values = & source & "!R14C8:R" & timeSteps + 13 & "C8" 'H 

'kappa/2 

ActiveChart.SeriesCollection(5).XValues = & source & "!R" & j & "C5:R" & timeSteps + 13 & 

"C5" 

ActiveChart.SeriesCollection(5).Values = & source & "!R14C11:R" & timeSteps + 13 & "Cll" 


'r-p 

ActiveChart.SeriesCollection(6).XValues = & source & "!R" & j & "C12:R" & timeSteps + 13 & 

"C12" 

ActiveCliart.SeriesCollection(6).Values = & source & "!R14C6:R" & timeSteps + 13 & "C6" 

' Acti veChart. SeriesCollection( 1) .ErrorBars .Select 
' ExecuteExceWMacro _ 

' "ERRORBAR.Y(2,5,""=" & source & "!R" & j & "C7:R" & timeSteps + 13 & 

"C7"",""=A_Band_Stats!$F$" & j & ":$F$" & timeSteps + 13 &.)" 

ActiveChart.ShapesC'Text Box 6").Select 

Selection.Characters.Text = "K= " & kappa & Clir(lO) & "r= " & r & Chr(lO) & "p= " & p & Clir(lO) & 
"" & Chr(lO) & "R2= " & r2 & Chr(lO) & "" & Clir(lO) & "" 

ActiveChart.ChartTitle.Characters.Text = "Learning Curve — " & band & " (Mean and Capacity)" & 
Chr(10)_ 

& technologyName & " (" & stepinterval & ")" 

ActiveSheet.Name = "" & band & " Leaming Vs Cum" 

Application.DisplayAlerts = True 

End Sub 


Sub: CopyABCDGraph 
Author: Matt Behnke 
Created: 11/7/01 

Description: copies the ABCD band mean graph and changes the dataseries to point to the right data., 
inputs: 

Outputs: 


Sub CopyABCDGraph() 
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Application. DisplayAlerts = False 

kappa = Slieets("D_Band_Stats").Cells(3, 4) 
r = Slieets("D_Band_Stats").Cells(4, 4) 
p = Slieets("D_Band_Stats").Cells(5, 4) 
r2 = Slieets("D_Band_Stats").Cells(6,4) 

Windows("Affdiation]Vlacro.xls").Activate 
Sheets("ABCD_Band_Leaming_Vs_Cum"). Select 
Slieets("ABCD_Band_Leaming_Vs_Cum").Copy After~Workbooks( _ 
currFilename).Slieets(Slieets.Count) 

ActiveChart.PlotArea. Select 
' ActiveChart.SeriesCollection(3).Delete 
currChartName = ActiveChart.Name 

timeSteps = CountCols("Affiliation_Cum_Dist_D_Band", 1) - 3 

If testBandA > 0 Then 

'aband 
j = 14 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets("A_Band_Stats").Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

Charts(currChartName) .Select 
'aband mean 

ActiveChart.SeriesCollection(l).XValues = "=A_Band_Stats!R" & j & "C3:R" & timeSteps + 13 & "C3" 
ActiveChart.SeriesCollection(l).Values = "=A_Band_Stats!R" & j & "C6:R" & timeSteps + 13 & "C6" 

'calc y 

ActiveChart.SeriesCollection(2).XValues = "=A_Band_Stats!R" & j + 1 & "C3:R" & timeSteps + 13 & 
"C3" 

ActiveChart.SeriesCollection(2).Values = "=A_Band_Stats!R" & j + 1 & "C13:R" & timeSteps + 13 & 
"C13" 

'3sigma 3sigma 

ActiveChart.SeriesCollection(3).XValues = "=A_Band_Stats!R" & j + 1 & "C5:R" & timeSteps + 13 & 
"C5" 

ActiveChart.SeriesCollection(3). Values = "=A_Band_Stats!R" & j & "C15:R" & timeSteps + 13 & 

"C15" 

'aband kappa 

ActiveChart.SeriesCollection(7).XValues = "=A_Band_Stats!R" & j & "C5:R" & timeSteps + 13 & "C5" 
ActiveChart.SeriesCollection(7).Values = "=A_Band_Stats!R" & j & "C10:R" & timeSteps + 13 & 

"CIO" 

'aband 3sig 3sig 

ActiveChart.SeriesCollection(8).XValues = "=A_Band_Stats!R" & j & "C5:R" & timeSteps + 13 & "C5" 
ActiveChart.SeriesCollection(8).Values = "=A_Band_Stats!R" & timeSteps + 14 & "C8" 

'aband kappa /2 
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ActiveChart.SeriesCollection(9).XValues = "=A_Band_Stats!R" & j & "C5:R" & timeSteps + 13 & "C5" 
ActiveChart.SeriesCollection(9iValues = "=A_Band_Stats!R14Cll:R" & timeSteps + 13 & "Cll" 

'aband r-p 

AetiveChart.SeriesColleetion(10).XValues = '-A Band StatsIR" & j & "C12:R" & timeSteps + 13 & 
"C12" 

AetiveChart.SeriesColleetion(10).Values = "=A_Band_Stats!R14C6:R" & timeSteps + 13 & "C6" 

End If 'test band a > 0 

'AetiveChart. SeriesColleetion( 1 ).ErrorBars. Seleet 
'ExeeuteExeel4Maero _ 

' "ERRORBAR.Y(2,5,""=A_Band_Stats!R" & j & "C7:R" & timeSteps + 13 & 
"C7"",""=A_Band_Stats!$F$" & j & ":$F$" & timeSteps + 13 &.)" 

'bband ***********tHIS IS CORRECT. 

If testBandB > 0 Then 
j = 14 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets("B_Band_Stats").CellsG, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

' AetiveChart.SeriesColleetion(4).XValues = "=B_Band_Stats!R" & j & "C3:R" & timeSteps + 13 & 

"C3" 

' AetiveChart.SeriesColleetion(4).Values = "=B_Band_Stats!R" & j & "C6:R" & timeSteps + 13 & "C6" 
Else 

' AetiveChart. SeriesColleetion(4).X Values = "=0" 

' AetiveChart. SeriesColleetion(4). Values = "=0" 

End If 'testbandB 


'eband 

If testBandC > 0 Then 
j = 14 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets("C_Band_Stats").CellsG, 3).Value 
If Not G > 0) Then 

j=j + l 

End If 
Wend 

AetiveChart.SeriesColleetion(5).XValues = "=C_Band_Stats!R" & j & "C3:R" & timeSteps + 13 & "C3" 
AetiveChart.SeriesColleetion(5). Values = "=C_Band_Stats!R" & j & "C6:R" & timeSteps + 13 & "C6" 
Else 

' AetiveChart.SeriesColleetion(5).XValues ="" 

' AetiveChart.SeriesColleetion(5).Values = "" 

End If 'testBandC 

'dband 

If testBandD > 0 Then 
j = 14 
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While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets("D_Band_Stats").CellsG, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

' ActiveChart.SeriesCollection(6).XValues = "=D_Band_Stats!R" & j & "C3:R" & timeSteps + 13 & 

"C3" 

' ActiveChart.SeriesCollection(6).Values = "=D_Band_Stats!R" & j & "C6:R" & timeSteps + 13 & "C6" 
Else 

ActiveChart.SeriesCollection(6).XValues = "=0" 

ActiveChart.SeriesCollection(6).Values = "=0" 

End If 'test band d 

'kappa textbox 

' ActiveChart.Shapes("Text Box 7").Select 

' Selection.Characters.Text = "K= " & kappa & Chr(lO) & "r= " & r & Chr(lO) & "p= " & p & Chr(lO) & 
"" & Chr(lO) & "R2= " & r2 & Chr(lO) & "" & Chr(lO) & "" 

' ActiveChart.ChartTitle.Characters.Text = ActiveChart.ChartTitle.Characters.Text & Chr(lO) _ 

' & technologyName & " (" & stepinterval & ")" 

Application.DisplayAlerts = True 

End Sub 


Sub: copyBandSummaryGraphs 
Author: Matt Behnke 
Created: 12/13/01 

Description: Copies the band ENTROPY graphs and published messages summary graphs 
inputs: band name 

Outputs: 


Sub CopyBandSummaryGraphs(ByVal band As String) 

Application.DisplayAlerts = False 

If band = "World" Then 

source = "AffiliationSummary" 

Else 

source = "Affiliation Summary" & band 
End If 

numRows = CountRows(source, 1) 

'GRAPH ONE message_N_k+l vs N_k 

Windows("AffdiationMacro.xls").Activate 
Sheets("A_Band_Message_N_k+l vs N_k").Select 

Sheets("A_Band_Message_N_k+l vs N_k").Copy Before:=Workbooks(currFilename).Sheets(band & 
"_Stats") 
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ActiveChart.SeriesCollection(l). Select 
j =4 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets(source).Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

If Sheets(source).Cells(i, 2).Characters(I, I).Text = "I" And Sheets(source).Cells(i, 2).Characters(2, 
l).Text ="/" Then 
yx=j 
Else 
yx=j 
End If 

ActiveChart.SeriesCollection(I).XVaIues = & source & "!R" & j & "C3:R" & numRows - I & "C3" 

ActiveChart.SeriesCollection(I). Values = & source & "!R" & j + I & "C3:R" & numRows & "C3" 

'y=x: 

ActiveChart.SeriesCollection(2).XVaIues = & source & "!R" & yx & "C3:R" & numRows & "C3" 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & yx & "C3:R" & numRows & "C3" 

titleBefore = ActiveChart.ChartTitle.Characters.Text 

ActiveChart.ChartTitle.Characters.Text = band & " " & titleBefore & Chr(IO) _ 

& technologyName & " (" & steplnterval & ")" 

'place subscripts in the chart title (N_k+1, N_k) 

If band = "World" Then 

ActiveChart.ChartTitle. Select 
With Selection.Characters(Start~29, Length~3).Font 
.Subscript = True 
End With 

With Selection.Characters(Start~34, Length~l).Font 
.Subscript = True 
End With 
Else 

ActiveChart.ChartTitle. Select 
With Selection.Characters(Start~30, Length~3).Font 
.Subscript = True 
End With 

With Selection.Characters(Start~35, Length~l).Font 
.Subscript = True 
End With 
End If 

ActiveSheet.Name = "" & band & "_Message_N_k+I vs N_k" 

'copy second graph S_k+I vs S_k 
Windows("AffiIiation]VIacro.xIs").Activate 
Sheets("A_Band_WorId_S_k+I vs S_k").Select 

Sheets("A_Band_WorId_S_k+I vs S_k").Copy Before~Workbooks(currFiIename).Sheets(band & 
"_Stats") 

ActiveChart.SeriesCollection(I). Select 
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If band = "World" Then 

ActiveChart.SeriesCollection(l).XValues = '& source & "!R" & j & "C8:R" & numRows - 1 & 

"C8" 

ActiveChart.SeriesCollection(l).Values = '& source & "!R" & j + 1 & "C8:R" & numRows & 

"C8" 

'y=x: 

ActiveChart.SeriesCollection(2).XValues = '& source & "!R" & yx & "C8:R" & numRows & "C8" 
ActiveChart.SeriesCollection(2).Values = '& source & "!R" & yx & "C8:R" & numRows & "C8" 

Else 

ActiveChart.SeriesCollection(l).XValues = '& source & "!R" & j & "C6:R" & numRows - 1 & 

"C6" 

ActiveChart.SeriesCollection(l).Values = '& source & "!R" & j + 1 & "C6:R" & numRows & 

"C6" 

'y=x: 

ActiveChart.SeriesCollection(2).XValues = '& source & "!R" & yx & "C6:R" & numRows & "C6" 
ActiveChart.SeriesCollection(2).Values = '& source & "!R" & yx & "C6:R" & numRows & "C6" 
End If 

tideBefore = ActiveChart.ChartTide.Characters.Text 

ActiveChart.ChartTitle.Characters.Text = band & " " & titleBefore & Chr(lO) _ 

& technologyName & " (" & stepinterval & ")" 

Active Sheet.Name = & band & "_Entropy_S_k+l vs S_k" 

'place subscripts in the chart title (Entropy S_k+1, S_k) 

If band = "World" Then 

ActiveChart.ChartTitle. Select 

With Selection.Characters(Start~24, Length~3).Font 
.Subscript = True 
End With 

With Selection.Characters(Start~34, Length~l).Font 
.Subscript = True 
End With 
Else 

ActiveChart.ChartTitle. Select 
With Selection.Characters(Start~25, Length~3).Font 
.Subscript = True 
End With 

With Selection.Characters(Start~35, Length~l).Font 
.Subscript = True 
End With 
End If 


If band = "World" Then 
Else 

'copy third Graph S(Y)_k+l vs S world k 

Windows("AffdiationMacro.xls").Activate 
Sheets("A_Band_S(Y)_k+l Vs S_world_k").Select 

Sheets("A_Band_S(Y)_k+l Vs S_world_k").Copy Before~Workbooks(currFilename).Sheets(band & 
"_Stats") 

ActiveChart.SeriesCollection(l). Select 
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ActiveChart.SeriesCollection(l).XValues = '-Affiliation SummaryIR" & j & "C8:R" & numRows & 

"C8" 

ActiveChart.SeriesCollectioii(l).Values = '& source & "!R" & j + 1 & "C6:R" & numRows & 

"C6" 

'y=x 

ActiveChart.SeriesCollection(2).XValues = '-Affiliation SummaryIR" & yx & "C8:R" & numRows 
& "C8" 

ActiveChart.SeriesCollection(2).Values = '-Affiliation SummaryIR" & yx & "C8:R" & numRows & 

"C8" 

titlcBefore = ActiveChart.ChartTitle.Characters.Text 

ActiveChart.ChartTitle.Characters.Text = band & " " & titlcBefore & Chr(lO) _ 

& technologyName & " (" & stepinterval & ")" 

'subscripts in chart title 
ActiveChart.ChartTitle. Select 

With Selection.Characters(Start~24, Length~4).Font 

• Subscript = True 
End With 

With Selection.Characters(Start~33, Length~3).Font 

• Subscript = True 
End With 

With Selection^Characters(Start~39, Length~5)^Font 

• Subscript = True 
End With 

With Selection^Characters(Start~49, Length” l)^Font 

• Subscript = True 
End With 

ActiveSheet^Name = "" & band & "_S(X,Y)_k+l vs S world k" 

End If 

Application^DisplayAlerts = True 
End Sub 'copy summary band graphs 


Sub: FillBandAuthors 
Author: Matt Behnke 
Created: 11/7/01 

Description: fills in a bands author distribution by copying a row from the list of 
affilations with the number of authors as the matrix's values^ 
inputs: band name 

Outputs: 


Sub FillBandAuthors(ByVal band As String) 

Sheets^Add 

sheetName = "Aff Author Cum Dist" & band & "" 
ActiveSheet^Name = sheetName 
currsheetName = ActiveSheet^Name 

numRowsInBand = CountRows("Affihation_Cum_Dist_" & band, 1) 
numRowsInAuthors = CountRows("Affiliation_Authors", 1) 
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'Sheets(Worksheets.Count). Select 
Columns("C:C").ColumnWidth = 62.43 

Sheets(currsheetName).Move Before~Sheets("" & band & " Stats") 

Sheets("Affiliation_Authors").Select 

Rows("l:l").Select 

Selection.Copy 

Sheets(currslieetName).Select 

Rows("l:l").Select 

ActiveSheet.Paste 

counter = 2 

For i = 2 To numRowsInBand 'copy rows from datasheet into band 

affiliationName = Sheets("Affiliation_Cum_Dist_" & band).Cells(i, 3).Value 
For j = 2 To numRowsInAuthors 

If Sheets("Affiliation_Authors").Cells(j, 3).Value = affiliationName Then 
Sheets("Affihation_Authors"). Select 
Rows(j & &j).Select 

Selection.Copy 
Sheets(currsheetName). Select 
Rows(counter & & counter).Select 

Active Sheet. Paste 
counter = counter + 1 
End If 
Next) 

Next i 

numRowsInAuthorBand = CountRows(currsheetName, 1) 
numColumns = CountCols(currsheetName, 1) 'num time steps 

Sheets(sheetName).Cells(numRowsInAuthorBand +1,3) = "Count" 
Sheets(sheetName).Cells(numRowsInAuthorBand + 2, 3) = "Mean" 
Sheets(sheetName).Cells(numRowsInAuthorBand + 3,3) = "Std Dev" 
Sheets(sheetName).Cells(numRowsInAuthorBand + 4, 3) = "Sum" 

For i = 4 To numColumns 'put in the mean and std deviation for each time step 

' For j = 2 To numRowsInAuthorBand 

' If (Cells(j, i) > 0 And i > 4) Or (i > 4 Andsheets(sheetName).Cells(j, i - 1) > 0) Then 

' sheets(sheetName).Cells(j, i) =sheets(sheetName).Cells(j, i) +sheets(sheetName).Cells(j, i - 1) 

' End If 
' Next) 

Sheets(sheetName).Cells(numRowsInAuthorBand + 4, i) = "=Sum(" & col(i) & "2:" & col(i) & 
numRowsInAuthorBand & ")" 

'add count, avg, stdev... 

Sheets(sheetName).Cells(numRowsInAuthorBand + 1, i).Formula = "=Countif(" & col(i) & "2:" & 
col(i) & numRowsInAuthorBand & ", "">0"")" 

If Sheets(sheetName).Cells(numRowsInAuthorBand + 1, i) > 0 Then 
Sheets(sheetName).Cells(numRowsInAuthorBand + 2, i).Formula = "=AVERAGE(" & col(i) & "2:" 
& col(i) & numRowsInAuthorBand & ")" 

If Sheets(sheetName).Cells(numRowsInAuthorBand + 1, i) > 1 Then 'more than one so comput std 
deviation 
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Sheets(sheetName).Cells(numRowsInAuthorBand + 3, i).Formula = '-STDEV(" & col(i) & "2:" 
& col(i) & numRowsIuAuthorBand & ")" 

End If 
End If 
Next i 


'Call formatSheetForPrint 
End Sub 'band authors 


Sub: CalcCumulative 
Author: Matt Behnke 
Created: 11/15/01 

Description: processes the input sheet (a matrix) to calculate the cumulative number of 
instances per time step, 
inputs: sheetName 

Outputs: 


Sub CalcCumulative(ByVal sheetName As String) 

uumRows = CountRows(sheetName, 1) 

numCols = CountCols(sheetName, 1) 

Sheets(sheetName) .Select 

If sheetName = affiliatiouDescMatrix Then 
For i = 2 To uumRows 
cellSum = 0 
prevSum = 0 
curSum = 0 
For j = 6 To numCols 

prevSum= Sheets(sheetName).Cells(i, j - 1) 
curSum = Sheets(sheetName).Cells(i, j) 
cellSum = prevSum + curSum 
If cellSum > 0 Then 

Sheets(sheetName).Cells(i, j) = cellSum 
End If 
Next) 

Next i 

» 

' — For the authors matrix zeros must be put in when 

' there is no publication in an instance 

» 

Elself sheetName = "Affiliation authors" Or sheetName = datasheet Then 
For i = 2 To uumRows 
cellSum = 0 
prevSum = 0 
curSum = 0 
For j = 4 To numCols 

Ifj > 4 Then 'when not in first column 

prevSum = Sheets(sheetName).Cells(i, j - 1) 
curSum = Sheets(sheetName).Cells(i, j) 
cellSum = prevSum + curSum 
Sheets(sheetName).Cells(i, j) = cellSum 
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Else 'in first column 

If Not Sheets(sheetName).Cells(i, j) > 0 Then 
Sheets(sheetName).Cells(i, j) = 0 
End If 
End If 

Next j 
Next i 
Else 

For i = 2 To numRows 
cellSum = 0 
prevSum = 0 
curSum = 0 
For j = 5 To numCols 

prevSum= Sheets(sheetName).Cells(i, j - 1) 
curSum = Sheets(sheetName).Cells(i, j). Value 
If Not curSum = "" And Not prevSum = "" Then 
cellSum = prevSum + curSum 
Elself curSum = "" And Not prevSum = "" Then 
cellSum = prevSum 

Elself prevSum = "" And Not curSum = "" Then 
cellSum = curSum 
End If 

If cellSum > 0 Then 

Sheets(sheetName).Cells(i, j) = cellSum 
End If 
Next j 
Next i 
End If 

If sheetName = datasheet Or sheetName = "Affiliation authors" Then 'put count, mean, and stdev in 
each column 

For i = 4 To numCols 'put in the mean and std deviation for each time step 

Sheets(sheetName).Cells(numRows + 4, i).Formula = "=Sum(" & col(i) & "2:" & col(i) & 
numRows & ")" 'sum 

Sheets(sheetName).Cells(numRows + 1, i).Formula = "=Countif(" & col(i) & "2:" & col(i) & 
numRows & ", "">0"")" 

If Sheets(sheetName).Cells(numRows + 1, i) > 0 Then 

Sheets(sheetName).Cells(numRows + 2, i).Formula = "=AVERAGE(" & col(i) & "2:" & col(i) & 
numRows & ")" 

If Sheets(sheetName).Cells(numRows + 1, i) > 1 Then 'more than one so comput std deviation 
Sheets(sheetName).Cells(numRows + 3, i).Formula = "=STDEV(" & col(i) & "2:" & col(i) & 
numRows & ")" 

End If 
End If 
Next i 
Else 

'put in the sum of the columns 
For i = 4 To numCols 

Sheets(sheetName).Cells(numRows + 1, i).Formula = "=Sum(" & col(i) & "2:" & col(i) & 
numRows & ")" 

Next i 

End If 'sheetname = datasheet 
End Sub 
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Sub: FillBandTerms 
Author: Matt Behnke 
Created: 11/7/01 

Deseription: fdls in the term instanees for a band 
inputs: band name 

Outputs: 


Sub FillBandTerms(ByVal band As String) 
sheetName = "Term Dist " & band & "" 

Sheets.Add 

AetiveSheet.Name = sheetName 
currsheetName = sheetName 

eounter = 2 

affiliationDeseMatrix = "deseriptormatrixaffil" 
numRowsInBand = CountRows("Affihation_Cum_Dist_" & band, 1) 
numColumnsInTerms = CountCols(affdiationDeseMatrix, 1) 


Columns("C:C").ColumnWidth = 32.43 
'move the sheet 

Sheets(eurrsheetName).Move Before:=Sheets("" & band & " Stats") 

'header 

Sheets(sheetName).Cells(l, 1) = Sheets(affdiationDeseMatrix).Cells(l, 1) 

Sheets(sheetName).Cells(l, 2) = Sheets(affdiationDeseMatrix).Cells(l, 2) 

Sheets(sheetName).Cells(l, 3) = Sheets(affdiationDeseMatrix).Cells(l, 3) 

For i = 4 To numColumnsInTerms - 1 'eopy time interval header 
Sheets(sheetName).Cells(l, i) = Sheets("deseriptor_matrix_affil").Cells(l, i + 1) 

Next i 

'fill in the terms and instanees... 

For i = 2 To numRowsInBand 'eopy rows from datasheet into band 

affiliationName = Sheets("Affiliation_Cum_Dist_" & band).Cells(i, 3).Value 
For j = 2 To CountRows(affihationDeseMatrix, 1) 

If Sheets(affihationDeseMatrix).Cells(j, 4) = affiliationName Then 

rowInAffiliationDeseMatrix = j 

termName = Sheets(affihationDeseMatrix).Cells(rowInAffihationDeseMatrix, 3) 

'eheek to see if term exists already on band's list of terms 
termRowInBand = fmdStringInSheet(eurrsheetName, termName, 3) 

If termRowInBand > 0 Then 

Sheets(sheetName).Cells(termRowInBand, 2) = Sheets(sheetName).Cells(termRowInBand, 2) 
+ Sheets(affihationDeseMatrix).Cells(rowInAffiliationDeseMatrix, 2) 
eellSum = 0 
prevSum = 0 
eurSum = 0 
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If Sheets(affiliationDescMatrix).Cells(rowInAffiliationDescMatrix, 1). Value < 
Sheets(sheetName).Cells(termRowInBand, 1) Then 

Sheets(sheetName).Cells(termRowInBand, 1) = 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, 1). Value 
End If 

For z = 4 To numColumnsInTerms 'add the values for each time time to what aleady exists 
If z > 4 Then 'add cumulative sum of term instances (previous + current + numlnstances) 
'prevSum = sheets(sheetName).Cells(termRowInBand, z - 1) 

If Sheets(affdiationDescMatrix).Cells(rowInAffihationDescMatrix, z + 1) > 0 Then 
curSum = Sheets(sheetName).Cells(termRowInBand, z) + 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, z + 1) 

Else 

curSum = 0 
End If 

'cellSum = prevSum + curSum 
If curSum > 0 Then 

Sheets(sheetName).Cells(termRowInBand, z) = curSum 
End If 
Else 

If Sheets(affdiationDescMatrix).Cells(rowInAffihationDescMatrix, z + 1) > 0 Then 
Sheets(sheetName).Cells(termRowInBand, z) = 
Sheets(sheetName).Cells(termRowInBand, z) + 

Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, z + 1) 

End If 
End If 
Next z 

Else 'term not found 

Sheets(sheetName).Cells(counter, 1) = 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, 1) 

Sheets(sheetName).Cells(counter, 2) = 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, 2) 

Sheets(sheetName).Cells(counter, 3) = 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, 3) 

For z = 4 To numColumnsInTerms 

If z > 4 Then 'add cumulative sum of term instances (previous + current + numlnstances) 

'prevSum = sheets(sheetName).Cells(counter, z - 1) 
curSum = Sheets(sheetName).Cells(counter, z) + 
Sheets(affiliationDescMatrix).Cells(rowInAffihationDescMatrix, z + 1) 

'cellSum = prevSum + curSum 

If curSum > 0 Then 

Sheets(sheetName).Cells(counter, z) = curSum 
End If 
Else 

If Sheets(affdiationDescMatrix).Cells(rowInAffihationDescMatrix, z + 1) > 0 Then 
Sheets(sheetName).Cells(counter, z) = Sheets(sheetName).Cells(counter, z) + 
Sheets(affiliationDescMatrix).Cells(rowInAffdiationDescMatrix, z + 1) 

End If 'climates zeros 
End If'z = 4 
Next z 

counter = counter + 1 
End If 'if-found-else-not 
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End If' affiliation name matches 
Next j 
Next i 

numRows = CountRows(sheetName, 1) 
numCols = CountCols(sheetName, 1) 

For i = 4 To numCols 

Sheets(sheetName).Cells(numRows + 1, i).Formula = '-Sum(" & col(i) & "2:" & col(i) & 
numRows & ")" 

Next i 

'Call CalcCumulative (Active She et .Name) 

Call formatSheetForPrint 
End Sub 'fill band terms 


Sub: FillBandTermsEntropy 
Author: Matt Behnke 
Created: 11/17/01 

Description: computes the entropy of a band's terms. 

and the contribution of the band., 
inputs: band name 

Outputs: 


Sub FillBandTermsEntropy(ByVal band As String) 

Sheets.Add 

currsheetName = ActiveSheet.Name 

numRows = CountRows("Term_Dist_" & band & 1) 

numColumns = CountCols("Term_Dist_" & band & 1) 

'Sheets(Worksheets.Count). Select 

Columns("C:C").ColumnWidth = 32.43 

Sheets(currsheetName).Move Before:=Sheets("" & band & " Stats") 
numRowsWorld = CountRows(descriptorMatrixSheet, 1) 

'copy term distribution sheet for entropy 

Worksheets("Term_Dist_" & band & "").Range("Al:" & col(numColumns) & numRows).Copy 
Destination:=Worksheets(currsheetName).Range("Al") 


For i = 2 To numRows 

termName = Sheets(currsheetName).Cells(i, 3) 
termCount = Sheets(currsheetName).Cells(i, 2) 

termRowInWorldEntropy = findStringInSheet(worldEntropySheet, termName, 3) 
termRowInDescriptorMatrix = termRowInWorldEntropy 

For z = 4 To numColumns 

If Sheets(currsheetName).Cells(i, z).Value >= 1 Then 

termCountInBandInStep = Sheets(currsheetName).Cells(i, z) 
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sumlnstancesBand = Sheets("Term_Dist_" & band & "").Cells(numRows + 1, z) 
pTerm = termCountInBandInStep / sumlnstancesBand 
entropyTerm = -pTerm * (Log(pTenn) / Log(2)) 

Sheets(currsheetName).Cells(i, z) = entropyTem 

End If 
Next z 
Next i 

Sheets(currsheetName). Select 

Sheets(currsheetName).Cells(numRows +1,3) = "Sum" 

Sheets(currsheetName).Cells(numRows + 2, 3) = "Contribution" 
Sheets(currsheetName).Cells(numRows + 3,3) = "Difference" 

For i = 4 To numColumns 

Sheets(currsheetName).Cells(numRows + 1, i).Formula = "=Sum(" & col(i) & "2:" & col(i) & 
numRows & ")" 

numlnstancesWorld = Sheets(descriptorMatrixSheet).Cells(numRowsWorld + 1, i) 

'3/15/02 - change the num of instances of the world to = the total instances seen at the 
' last timestep. 

'this is WRONG... 3/15: 


'set numlnstances World = to the total num of instance over all the time steps and terms 
'numlnstancesWorld = Sheets(descriptorMatrixSheet).Cells(numRowsWorld + 1, numColumns) 


numlnstancesBand = Sheets("Term_Dist_" & band & "").Cells(numRows + 1, i) 

If numlnstancesBand > 0 Then 

ratio 1 = numlnstance s World / numlnstancesBand 
ratio2 = numlnstancesBand / numlnstances World 

entropySum = Sheets(currsheetName).Cells(numRows + 1, i) 
contributionOfBand = Abs(ratio2) * entropySum + (ratio2 * (Log(ratiol) / Log(2))) 
Sheets(currsheetName).Cells(numRows + 2, i) = contributionOfBand 
Sheets(currsheetName).Cells(numRows + 3, i) = Abs(entropySum - contributionOfBand) 
Else 

Sheets(currsheetName).Cells(numRows + 2, i) = 0 
Sheets(currsheetName).Cells(numRows + 3, i) = 0 
End If 
Next i 

ActiveSheet.Name = "Term Entropy Dist" & band & "" 

' Call formatSheetForPrint 

End Sub 'fdl band terms entropy 
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Sub: affiliationBandSummary 
Author: Matt Behnke 
Created: 11/30/01 

Deseription: ereates the summary sheet for the band.. 

shows step, num of reeors, authors, terms, entropy., 
inputs: band - the name of the band 
Outputs: none 


Sub affiliationBandSummary (By Val band As String) 

sheetName = "Affiliation Summary " & band 

numColumns = CountCols("Term_Entropy_Dist_" & band, 1) 
numRowsAffiliation = CountRows("Affiliation_Cum_Dist_" & band, 1) 
numRowsAuthor = CountRows("Aff_Author_Cum_Dist_" & band, 1) 
numRowsTermDist = CountRows("Term_Dist_" & band, 1) 
numRowsTermEntropy = CountRows("Term_Entropy_Dist_" & band, 1) 

Sheets.Add 

AetiveSheet.Name = sheetName 

currsheetName = AetiveSheet.Name 

Sheets(sheetName).Move Before:=Sheets(band & " Stats") 

Sheets(currsheetName). Seleet 
Aetive Sheet. StandardWidth = 13 

Sheets(sheetName).Cells(l, 1) = "" 

Sheets(sheetName).Cells(2, 1) = "" 

Sheets(sheetName).Cells(2, 3) = "Instanees (Previous + Current)" 

Sheets(sheetName).Cells(3, 1) = "Step" 

Sheets(sheetName).Cells(3, 2) = "interval" 

Sheets(sheetName).Cells(3, 3) = "Reeords" 

Sheets(sheetName).Cells(3, 4) = "Authors" 

Sheets(sheetName).Cells(3, 5) = "Terms" 

Sheets(sheetName).Cells(3, 6) = "Entropy" 

Sheets(sheetName).Cells(3, 7) = "Contribution" 

Sheets(sheetName).Cells(3, 8) = "Differenee" 

Sheets(sheetName).Cells(3, 9) = "Ree / Author" 

For i = 4 To numColumns 

Sheets(sheetName).Cells(i, 1) = i - 3 

Sheets(sheetName).Cells(i, 2) = Sheets("Term_Dist_" & band).Cells(l, i) 

Sheets(sheetName).Cells(i, 3).Value = '-SUM(Affdiation_Cum_Dist_" & band & "!" & eol(i) & 
"$2:" & eol(i) & "$" & numRowsAffdiation & ")" 

Sheets(sheetName).Cells(i, 4).Value = '-SUM(Aff_Author_Cum_Dist_" & band & "!" & eol(i) & 
"$2:" & eol(i) & "$" & numRowsAuthor & ")" 
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Sheets(sheetName).Cells(i, 5).Value = "=SUM(Tenn_Dist_" & band & "!" & col(i) & "$2:" & col(i) 

& "$" & numRowsTermDist & ")" 

Sheets(sheetName).Cells(i, 6) = Sheets("Term_Entropy_Dist_" & band).Cells(numRowsTermEntropy 

+ 1, i) 

Sheets(sheetName).Cells(i, 7) = Sheets("Term_Entropy_Dist_" & band).Cells(numRowsTermEntropy 
+ 2, i) 

Sheets(sheetName).Cells(i, 8) = Sheets("Term_Entropy_Dist_" & band).Cells(numRowsTermEntropy 
+ 3, i) 

If Sheets(sheetName).Cells(i, 4) > 0 Then 

Sheets(sheetName).Cells(i, 9) = Sheets(sheetName).Cells(i, 3) / Sheets(sheetName).Cells(i, 4) 

End If 
Next i 


End Sub 


Sub: affiliationSummary 
Author: Matt Behnke 
Created: 11/30/01 
added stuff: 2/1/02 

Description: creates the world affilation summary sheet.. 

this is the first part., the second part puts in the temp poly and the pressure equations 

after fdl months has been run on the sheet. 

inputs: none 
Outputs: none 


Sub affiliationSummaryO 

numColumns = CountCols(worldEntropy Sheet, 1) 
numRowsAffdiation = CountRows (data Sheet, 1) 
numRowsAuthor = CountRows("Affiliation_authors", 1) 
numRowsTermDist = CountRows(descriptorMatrixSheet, 1) 
numRowsTermEntropy = CountRows(worldEntropySheet, 1) 

Sheets. Add After:=Worksheets(Worksheets.Count) 

' Sheets(Worksheets.Count). Select 
sheetName = "AffiliationSummary" 

ActiveSheet.Name = sheetName 
currsheetName = ActiveSheet.Name 

Sheets(currsheetName).Move After:=Sheets(Sheets.Count) 

Sheets(currsheetName). Select 
Active Sheet. StandardWidth = 13 


Sheets(sheetName).Cells(l, 1) = "" 

Sheets(sheetName).Cells(2, 1) = "" 

Sheets(sheetName).Cells(2, 3) = "Instances (Previous + Current)" 

Sheets(sheetName).Cells(3, 1) = "Step" 
Sheets(sheetName).Cells(3, 2) = "interval" 
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Sheets(sheetName).Cells(3, 3) = "Records" 

Sheets(sheetName).Cells(3, 4) = "Authors (v_X)" 

Sheets(sheetName).Cells(3, 5) = "Rec / Author" 

Sheets(sheetName).Cells(3, 6) = "Terms X" 

Sheets(sheetName).Cells(3, 7) = "Terms Y" 

Sheets(sheetName).Cells(3, 8) = "S(X)" 

Sheets(sheetName).Cells(3, 9) = "S(Y)" 

Sheets(sheetName).Cells(3, 10) = "S(X,Y)" 

Sheets(sheetName).Cells(3, 11) = "S(X;Y)" 

Sheets(sheetName).Cells(3, 12) = "deltanx" 

Sheets(sheetName).Cells(3, 13) = "deltasx" 

Sheets(sheetName).Cells(3, 14) = "T_X Saboe Degrees" 

Sheets(sheetName).Cells(3, 15) = "deltany" 

Sheets(sheetName).Cells(3, 16) = "deltasy" 

Sheets(sheetName).Cells(3, 17) = "vYnodes" 

Sheets(sheetName).Cells(3, 18) = "pressure n per node" 

For i = 4 To numColumns 

Sheets(sheetName).Cells(i, 1) = i - 3 
Sheets(sheetName).Cells(i, 2) = Sheets(dataSheet).Cells(l, i) 

If i > 4 Then 

Sheets(sheetName).Cells(i, 3).Value = "=SUM("' & datasheet & & col(i) & "$2:" & col(i) & 

"$" & uumRowsAffdiation & ")" '+ C" & i - 1 
Else 

Sheets(sheetName).Cells(i, 3).Value = "=SUM("' & datasheet & & col(i) & "$2:" & col(i) & 

"$" & uumRowsAffiliation & ")" 

End If 

If i > 4 Then 

Sheets(sheetName).Cells(i, 4).Value = '-SUM(Affiliation_authors!" & col(i) & "$2:" & col(i) & 
"$" & numRowsAuthor & ")" ' + D" & i - 1 
Else 

Sheets(sheetName).Cells(i, 4).Value = '-SUM(Affiliation_authors!" & col(i) & "$2:" & col(i) & 
"$" & numRowsAuthor & ")" 

End If 

Sheets(sheetName).Cells(i, 5) = Sheets(sheetName).Cells(i, 3) / Sheets(sheetName).Cells(i, 4) 
Sheets(sheetName).Cells(i, 6).Value = "=SUM(" & descriptorMatrixSheet & "!" & col(i) & "$2:" & 
col(i) & "$" & numRowsTermDist & ")" 

Sheets(sheetName).Cells(i, 7).Value = "=SUM(" & descriptorMatrixSheetY & "!" & col(i) & "$2:" & 
col(i) & "$" & numRowsTermDist & ")" 

Sheets(sheetName).Cells(i, 8) = Sheets(worldEntropySheet).Cells(numRowsTermEntropy + 1, i) 
Sheets(sheetName).Cells(i, 9) = Sheets(worldEntropySheetY).Cells(numRowsTermEntropy + 1, i) 
Sheets(sheetName).Cells(i, 10) = Sheets(worldEntropySheet).Cells(numRowsTermEntropy + 1, 
numColumns) 

Sheets(sheetName).Cells(i, 11) = & col(8) & i & "+" & col(9) & i & & col(lO) & i 'Cells(i, 8) 

+ sheets(sheetName).Cells(i, 9) - sheets(sheetName).Cells(i, 10) 

If i > 4 Then 

Sheets(sheetName).Cells(i, 12) = '& col(6) & i & & col(6) & i - 1 'cells(i, 6) - cells (i-1,6) 

deltany 

Sheets(sheetName).Cells(i, 13) = & col(9) & i - 1 & & col(9) & i 'cells(i, 9) - cells (i-1,9) 

deltasx 

Sheets(sheetName).Cells(i, 14) = & col(12) & i & "/" & col(13) & i 'cells(i, 12) / cells(i, 13) 

T X 
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Sheets(sheetName).Cells(i, 15) = & col(7) & i & & col(7) & i - 1 'cells(i, 7) - cells (i-1,7) 

deltany 

Sheets(sheetName).Cells(i, 16) = & col(8) & i & & col(8) & i - 1 'cells(i, 8) - cells(i-l, 8) 

deltasy 

End If 

Sheets(sheetName).Cells(i, 17) = Sheets("Affdiation_authors").Cells(numRowsAuthor + 1, 
numColumns) - Sheets(sheetName).Cells(i, 4) 

Sheets(sheetName).Cells(i, 18) = '& col(6) & i & "/" & col(4) & i 'cells(i,6) / cells(i,4) terms X / 
author X 
Next i 

Sheets(sheetName).Cells(4, 3).Select 'freeze panes 
ActiveWindow.FreezePanes = True 


Call fillMonthsRow("Affiliation_Summary", 4) 

' Call fillMonthsRow("Affiliation_Summary", 4) 

'Call CopyInteractingSystemsGraphs(ActiveSheet.Name, numColumns) 
numRows = CountRows("Affiliation_Summary", 1) 

Call FillInMissingData("Affdiation_Summary", 6, 4, numRows) 

Call FillInMissingData("Affiliation_Summary", 9, 4, numRows) 

End Sub 


Sub: affibationSummaryPart2 
Author: Matt Behnke 
Created: 2/1/02 

Description: after fillmonths has been ran this procedure copies the appropriate graphs 
interactive systems graphs and temp / pressure graphs 
uses the trendline equations from the system graph to calculate 
temp_polynomial and 
the pressure equation 
inputs: none 
Outputs: none 


Sub affihationSummaryPart2() 

source = "AffiliationSummary" 
numRows = CountRows(source, 1) 

Sheets(source).Cells(3, 19) = "S(X) calculated" 
Sheets(source).Cells(3, 20) = "S(Y) calculated" 
Sheets(source).Cells(3, 21) = "delta S(X) calculated" 
Sheets(source).Cells(3, 22) = "n(X) calculated" 
Sheets(source).Cells(3, 23) = "delta n x calculated" 
Sheets(source).Cells(3, 24) = "T_X Saboe Deg. Polynomial" 

Call CopyInteractingSystemsGraphs(" World") 

trendlineA = Sheets(source).Cells(l, 19) 
sx_a = firstPartTrendEq(trendlineA) 
sx_b = secondPartTrendEq(trendhneA) 

'sx_a = firstPartPolyTrendEq(trendhneA) 

'sx_b = secondPartPolyTrendEq(trendhneA) 
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'sx_c = thirdPartPolyTrendEq(trendlineA) 

trendlineB = Sheets(source).Cells(l, 20) 
sy_a = firstPartPolyTrendEq(trendlineB) 
sy_b = secondPartPolyTrendEq(trendlineB) 
sy_c = thirdPartPolyTrendEq(trendlineB) 

trendline nX = Sheets(source).Cells(l, 22) 
nx_a = firstPartTrendEq(trendline_nX) 
nx_b = secondPartTrendEq(trendline_nX) 

For i = 4 To numRows 
k= Sheets(source).Cells(i, 1) 

'sX & sY calculated 

Sheets(source).Cells(i, 19) = sx_a * k ^ sx_b 'power equation of entropy 
'Sheets(source).Cells(i, 19) = sx_a * k 2 + sx_b * k + sx_c 
Sheets(source).Cells(i, 20) = sy_a * k ^ 2 + sy_b * k + sy_c 
'nX calculated 

Sheets(source).Cells(i, 22) = nx_a * (k ^ nx_b) 


If Sheets(source).Cells(i - 1, 6).Font.ColorIndex = 3 Then 
'find the first row of the same value 
X = i - 1 

While Sheets(source).Cells(x, 6).Font.ColorIndex = 3 
X = X - 1 
Wend 

previous SY = Sheets(source).Cells(x, 9) 'S(Y) from previous step 
previous nX = Sheets(source).Cells(x, 6) 'number of terms in previous step 
Else 

previous SY = Sheets(source).Cells(i - 1, 9) 'S(Y) from previous step 
previous nX = Sheets(source).Cells(i -1,6) 'number of terms in previous step 
End If 

If i > 4 Then 

'check to see if current S(Y) or current n(X) (num terms) is the same as previous 
'if so then place the value of the calculated S(Y) or n(X) into that spot of similarity 
'mark the spot in red where a calculated value has been substituted. 

If Sheets(source).Cells(i, 9) = previous SY Then 

Sheets(source).Cells(i, 9) = & col(20) & i 'equals calc'ed value of S(Y) 

Sheets(source).Cells(i, 9).Font.ColorIndex = 3 
End If 

If Sheets(source).Cells(i, 6) = previous nX Then 

Sheets(source).Cells(i, 6) = & col(22) & i 'equals calc'ed value of n(X) 

Sheets(source).Cells(i, 6).Font.ColorIndex = 3 
End If 

'delta S(X)_calculated 

Sheets(source).Cells(i, 21) = & col(19) & i & & col(19) & i - 1 'cells(i,19) - cells(i-l,19) 

'delta n(X)_calculated 

Sheets(source).Cells(i, 23) = & col(22) & i & & col(22) & i - 1 'cells(i,22) - cells(i-l,22) 

't(x)_poly = n(X)/S(X) 

Sheets(source).Cells(i, 24) = & col(23) & i & "/" & col(21) & i 'cells(i,23) / cells(i,21) 

End If 
Next i 
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End Sub 


' Sub: affiliationSummaryParts 
' Author: Matt Behnke 
' Created: 2/4/02 

' Deseription: eopies the temp / pressure graphs uses trendline equations of temp_poly and pressure to get 
the 

' the pressure equation 

' inputs: none 
' Outputs: none 


Sub affiliationSummaryParts 0 

souree = "AffiliationSummary" 
uumRows = CountRows(souree, 1) 

Sheets(souree).Cells(S, 25) = "Press f(T)" 
Sheets(souree).Cells(l, 24) = "m_P" 
Sheets(souree).Cells(2, 24) = "b_P" 
Sheets(souree).Cells(l, 26) = "m_T" 
Sheets(souree).Cells(2, 26) = "b_T" 

Call Copy!empPressGraphs(" World") 

trendline Tpoly = Sheets(souree).Cells(l, 27) 
m_t = firstPartTrendEq(trendhne_Tpoly) 
b_t = seeondPartLinearTrendEq(trendhne_Tpoly) 
Sheets(souree).Cells(l, 27) = m_t 
Sheets(souree).Cells(2, 27) = b_t 

trendline Press = Sheets(souree).Cells(l, 25) 
m_p = firstPartTrendEq(trendline_Press) 
b_p = seeondPartLinearTrendEq(trendline_Press) 
Sheets(souree).Cells(l, 25) = m_p 
Sheets(souree).Cells(2, 25) = b_p 


For i = 5 To uumRows 

Tx_poly = Sheets(souree).Cells(i, 24) 

Sheets(souree).Cells(i, 25) = b_p + (m_p / m_t) * (Tx_poly - b_t) 

Next i 

'eopy third Graph World Press vs Temp Saboe 
Applieation.DisplayAlerts = False 

Windows("AffiliationMaero.xls").Aetivate 

Sheets("World_Press_vs_Temp_Saboe").Seleet 

Sheets("World_Press_vs_Temp_Saboe").Copy After:=Workbooks(eurrFilename).Sheets(souree) 
AetiveChart.SeriesColleetion(l).Seleet 

'Pressure per node 

AetiveChart.SeriesColleetion(l).Values = & souree & "!R" & 5 & "C25:R" & uumRows & "C25" 

AetiveChart.SeriesColleetion(l).XValues = & souree & "!R" & 5 & "C24:R" & uumRows & "C24" 

'T(x) poly: 

AetiveChart.SeriesColleetion(2).Values = & souree & "!R" & 5 & "C18:R" & uumRows & "C18" 
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ActiveChart.SeriesCollection(2).XValues = & source & "!R" & 5 & "C24:R" & numRows & "C24" 

Application. Display Alerts = True 
End Sub 


Sub: copyTempPressGraphs 
Author: Matt Behnke 
Created: 2/4/02 

Description: copies the interacting systems graphs from the affdiation macro workbook, 
inputs: band name 

Outputs: 


Sub CopyTempPressGraphs(ByVal band As String) 

Application. Display Alerts = False 

If band = "World" Then 

source = "AffiliationSummary" 

Else 

source = "Affiliation Summary " & band 
End If 

numRows = CountRows(source, 1) 

'GRAPH ONE XY Temp 

Windows("AffiliationMacro.xls").Activate 

Sheets("XY_Temp").Select 

Sheets("XY_Temp").Copy After:=Workbooks(currFilename).Sheets(source) 
ActiveChart.SeriesCollection(l). Select 

j =4 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets(source).Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

'X-Category 
'msgs per node 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & j & "C18:R" & numRows & "C18" 
ActiveChart.SeriesCollection(l).XValues = & source & "!R" & j & "C1:R" & numRows & "Cl" 

't(x) 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & j & "C14:R" & numRows & "C14" 
ActiveChart.SeriesCollection(2).XValues = & source & "!R" & j & "C1:R" & numRows & "Cl" 

't(x) poly 

ActiveChart.SeriesCollection(3).Values = & source & "!R" & j & "C24:R" & numRows & "C24" 

ActiveChart.SeriesCollection(3).XValues = & source & "!R" & j & "C18:R" & numRows & "C18" 

With ActiveChart.SeriesCollection(2).Trendlines(l) 

'put trendline equation onto stats sheet for T(X)_poly 
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•DisplayEquation = True 
•DisplayRSquared = True 

End With 

With ActiveChart.SeriesCollection(3).Trendlines(l) 

'put trendline equation onto stats sheet for T(X)_poly 
•DisplayEquation = True 
•DisplayRSquared = True 

Worksheets(souree)^Cells(l, 27)^Value = •DataLabehText 
End With 

'copy second graph XY Press 
Windows("Affiliation]Vtacro^xls")^Activate 
Sheets("XY_Press")^Select 

Sheets("XY_Press")^Copy After~Workbooks(currFilename)^Sheets(source) 
ActiveChart^SeriesCollection(l)^ Select 
'Pressure per node 

ActiveChart^SeriesCollection(l)^Values = & source & "!R" & j & "C18:R" & numRows & "C18" 

ActiveChart^SeriesCollection(l>XValues = & source & "!R" & j & "C1:R" & numRows & "Cl" 

'T(x) poly: 

ActiveChart^SeriesCollection(2)^ Values = & source & "!R" & j & "C24:R" & numRows & "C24" 

ActiveChart^SeriesCollection(2ixValues = & source & "!R" & j & "C18:R" & numRows & "C18" 

With ActiveChart^SeriesCollection(2)^Trendlines(l) 

'put trendline equation onto stats sheet for pressure fit 
•DisplayEquation = True 
•DisplayRSquared = True 
End With 

With ActiveChart^ SeriesCollection( 1 )^Trendlines( 1) 

'put trendline equation onto stats sheet for pressure fit 
•DisplayEquation = True 
•DisplayRSquared = True 

Worksheets(source)^Cells(l, 25)^Value = •DataLabehText 
End With 

Application^ Display Alerts = Trae 
End Sub 'copy temp/press graphs 


Sub: copyInteractingSystemsGraphs 
Author: Matt Behnke 
Created: 2/1/02 

Description: copies the interacting systems graphs from the affdiation macro workbook^ 
inputs: band name 

Outputs: 


Sub CopyInteractingSystemsGraphs(ByVal band As String) 
Application^ Display Alerts = False 
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If band = "World" Then 

source = "AffiliationSummary" 

Else 

source = "Affiliation Summary " & band 
End If 

uumRows = CountRows(source, 1) 

'GRAPH ONE S_2Interacting Systems 

Windows("AffiliationMacro.xls").Activate 
Sheets(" S_2Interacting systems").Select 

Sheets("S_2Interacting systems").Copy Afler~Workbooks(currFilename).Sheets(source) 
ActiveChart.SeriesCollection(l). Select 

j ^ 4 

While Not (i > 0#) 'if the first time step's mean is zero find the step that doesnt have 0 
i = Sheets(source).Cells(j, 3).Value 
If Not (i > 0) Then 

j=j + l 

End If 
Wend 

'X-Category 

'S(Y) 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & j & "C9:R" & uumRows & "C9" 

ActiveChart.SeriesCollection(l).XValues = & source & "!R" & j & "C6:R" & uumRows & "C6" 

'S(X): 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & j & "C8:R" & numRows & "C8" 

ActiveChart.SeriesCollection(2).XValues = & source & "!R" & j & "C6:R" & numRows & "C6" 

'S(X,Y) 

ActiveChart.SeriesCollection(3).Values = & source & "!R" & j & "C10:R" & numRows & "CIO" 

ActiveChart.SeriesCollection(3).XValues = & source & "!R" & j & "C6:R" & numRows & "C6" 

'S(X;Y) 

ActiveChart.SeriesCollection(4).Values = & source & "!R" & j & "C11:R" & numRows & "Cll" 

ActiveChart.SeriesCollection(4).XValues = & source & "!R" & j & "C6:R" & numRows & "C6" 


With ActiveChart. SeriesCollection( 1 ).Trendlines( 1) 

'put trendline equation onto stats sheet for S(y) 
•DisplayEquation = True 
•DisplayRSquared = Tme 

Worksheets(source).Cells(l, 20).Value = .DataLabel.Text 
End With 

With ActiveChart.SeriesCollection(2).Trendhnes(l) 

'put trendline equation onto stats sheet for S(x) 
•DisplayEquation = True 
•DisplayRSquared = True 

Worksheets(source).Cells(l, 19).Value = .DataLabel.Text 
End With 

'copy second graph World_(X)Temp_S_2 
Windows("AffiliationMacro.xls").Activate 
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Sheets("World_(X)Temp_S_2").Select 

Sheets("World_(X)Temp_S_2").Copy After~Workbooks(currFilename).Sheets(source) 
ActiveChart.SeriesCollectioii(l). Select 
'S(X) vs T_X 

ActiveChart.SeriesCollectioii(l).Values = & source & "!R" & j & "C8:R" & uumRows & "C8" 

ActiveCliart.SeriesCollectiou(lixValues = & source & "!R" & j & "C14:R" & uumRows & "C14" 

'S(X;Y): 

ActiveChart.SeriesCollectiou(2).Values = & source & "!R" & j & "C11:R" & uumRows & "Cll" 

ActiveCliart.SeriesCollectiou(2).XValues = '& source & "!R" & j & "C6:R" & uumRows & "C6" 

With ActiveChart. SeriesCollectiou( 1 ).Treudliues( 1) 

'put treudliue equatiou outo stats sheet for S(y) 

•DisplayEquatiou = True 
•DisplayRSquared = Tme 

Eud With 

With ActiveChart.SeriesCollectiou(2).Treudhues(l) 

'put treudliue equatiou outo stats sheet for S(x) 

•DisplayEquatiou = True 
•DisplayRSquared = Trae 

Eud With 

'copy third Graph u_Msg_2Iuteractiug systems 
Wiudows("AffdiatiouMacro.xls").Activate 
Sheets("u_]Vtsg_2Iuteractiug systems").Select 

Sheets("uJVlsg_2Iuteractiug systems").Copy After~Workbooks(currFileuame).Sheets(source) 
ActiveChart.SeriesCollectiou(l). Select 

'u_X 

ActiveChart.SeriesCollectiou(l).Values = & source & "!R" & j & "C6:R" & uumRows & "C6" 

With ActiveChart.SeriesCollectiou( 1 ).Treudliues( 1) 

'put treudliue equatiou outo stats sheet for S(x) 

•DisplayEquatiou = True 
•DisplayRSquared = Tme 

Worksheets(source).Cells(l, 22).Value = .DataLabel.Text 
Eud With 

'u_Y 

ActiveChart.SeriesCollectiou(2).Values = "=Affiliatiou_Summary!R" & j & "C7:R" & uumRows & 
"C7" 

Applicatiou. Display Alerts = Trae 
Eud Sub 'copy iuteractiug systems graphs 


Sub: eutropySummary 
Author: Matt Behuke 
Created: 11/19/01 

Descriptiou: creates the eutropy summary sheet., for the world aud all the bauds, 
shows the local aud coutributiou eutropies of each baud 
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' inputs: none 
' Outputs: none 


Sub entropySummaryO 

sheetName = "Entropy Summary" 

numColumnsIuTerms = CountCols(worldEntropySheet, 1) 

If testBandA > 0 Then 

uumRowsAband = CountRows("Term_Entropy_Dist_A_Band", 1) 
End If 

If testBandB > 0 Then 

uumRowsBband = CountRows("Term_Entropy_Dist_B_Band", I) 
End If 

If testBandC > 0 Then 

numRowsCband = CountRows("Term_Entropy_Dist_C_Band", I) 
End If 

If testBandD > 0 Then 

numRowsDband = CountRows("Term_Entropy_Dist_D_Band", I) 
End If 

numRowsWorld = CountRows(worldEntropySheet, I) 

Sheets. Add After:=Worksheets(Worksheets.Count) 

' Sheets(Worksheets.Count). Select 
ActiveSheet.Name = sheetName 
currsheetName = sheetName 

Sheets(currsheetName).Move After:=Sheets("D_Band_Stats") 

Sheets(currsheetName).Select 
Active Sheet. StandardWidth = 13 

Sheets(sheetName).Cells(l, 1) = "" 

Sheets(sheetName).Cells(2, 1) = "" 

Sheets(sheetName).Cells(3, 1) = "Step" 

Sheets(sheetName).Cells(3, 2) = "interval" 

Sheets(sheetName).Cells(3, 3) = "A Band Entropy" 
Sheets(sheetName).Cells(3, 4) = "A Band Contribution" 
Sheets(sheetName).Cells(3, 5) = "A Band Difference" 

Sheets(sheetName).Cells(3, 6) = "B Band Entropy" 
Sheets(sheetName).Cells(3, 7) = "B Band Contribution" 
Sheets(sheetName).Cells(3, 8) = "B Band Difference" 

Sheets(sheetName).Cells(3, 9) = "C Band Entropy" 
Sheets(sheetName).Cells(3, 10) = "CBand Contribution" 
Sheets(sheetName).Cells(3, 11) = "CBand Difference" 

Sheets(sheetName).Cells(3, 12) = "DBand Entropy" 
Sheets(sheetName).Cells(3, 13) = "DBand Contribution" 
Sheets(sheetName).Cells(3, 14) = "DBand Difference" 
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Sheets(sheetName).Cells(3, 15) = "Sum Band Entropy" 
Sheets(sheetName).Cells(3, 16) = "Sum Band Contribution" 
Sheets(sheetName).Cells(3, 17) = "World Entropy" 
Sheets(sheetName).Cells(3, 18) = "Diff World & Contrib" 


For i = 4 To numColumnsInTerms 
Sheets(sheetName).Cells(i, 1) = i - 3 

Sheets(sheetName).Cells(i, 2) = Sheets("Term_Entropy_Dist_D_Band").Cells(l, i) 
If testBandA > 0 Then 


Sheets(sheetName).Cells(i, 3) = Sheets("Term_Entropy_Dist_A_Band").Cells(numRowsAband + 1, 

i) 

Sheets(sheetName).Cells(i, 4) = Sheets("Term_Entropy_Dist_A_Band").Cells(numRowsAband + 2, 

i) 

Sheets(sheetName).Cells(i, 5) = Sheets("Term_Entropy_Dist_A_Band").Cells(numRowsAband + 3, 

i) 

End If 

If testBandB > 0 Then 

Sheets(sheetName).Cells(i, 6) = Sheets("Term_Entropy_Dist_B_Band").Cells(numRowsBband + I, 

i) 

Sheets(sheetName).Cells(i, 7) = Sheets("Term_Entropy_Dist_B_Band").Cells(numRowsBband + 2, 

i) 

Sheets(sheetName).Cells(i, 8) = Sheets("Term_Entropy_Dist_B_Band").Cells(numRowsBband + 3, 

i) 

End If 

If testBandC > 0 Then 

Sheets(sheetName).Cells(i, 9) = Sheets("Term_Entropy_Dist_C_Band").Cells(numRowsCband + I, 

i) 

Sheets(sheetName).Cells(i, 10) = Sheets("Term_Entropy_Dist_C_Band").Cells(numRowsCband + 
2, i) 

Sheets(sheetName).Cells(i, II) = Sheets("Term_Entropy_Dist_C_Band").Cells(numRowsCband + 

3.1) 

End If 

If testBandD > 0 Then 

Sheets(sheetName).Cells(i, 12) = Sheets("Term_Entropy_Dist_D_Band").Cells(numRowsDband + 

1.1) 

Sheets(sheetName).Cells(i, 13) = Sheets("Term_Entropy_Dist_D_Band").Cells(numRowsDband + 
2, i) 

Sheets(sheetName).Cells(i, 14) = Sheets("Term_Entropy_Dist_D_Band").Cells(numRowsDband + 

3.1) 

End If 

Sheets(sheetName).Cells(i, 15) = Sheets(sheetName).Cells(i, 3) + Sheets(sheetName).Cells(i, 6) + 
Sheets(sheetName).Cells(i, 9) + Sheets(sheetName).Cells(i, 12) 

Sheets(sheetName).Cells(i, 16) = Sheets(sheetName).Cells(i, 4) + Sheets(sheetName).Cells(i, 7) + 
Sheets(sheetName).Cells(i, 10) + Sheets(sheetName).Cells(i, 13) 

Sheets(sheetName).Cells(i, 17) = Sheets(worIdEntropySheet).Cells(numRowsWorld + I, i) 
Sheets(sheetName).Cells(i, 18) = Abs(Cells(i, 17) - Sheets(sheetName).Cells(i, 16)) 

Next i 
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End Sub 


Sub: instances Summary 
Author: Matt Behnke 
Created: 4/10/02 

Description: creates a summary sheet of the number of instances in each band over time 
this is used to compare the number of instances in each band to the instances 
for the world. 

IF they are different then the source data for the band instances is not consistent 
with the source data for the world instances, (descriptor matrix affil vs. descriptor data x) 
inputs: none 
Outputs: none 


Sub instances Summary 0 

'testBandA = 1 
'testBandB = 1 
'testBandC = 1 
'testBandD = 1 

sheetName = "Instances Summary" 

numColumnsIuTerms = CountCols(descriptorMatrixSheet, 1) 

If testBandA > 0 Then 

numRowsAband = CountRows("Term_Dist_A_Band", 1) 

End If 

If testBandB > 0 Then 

numRowsBband = CountRows("Term_Dist_B_Band", 1) 

End If 

If testBandC > 0 Then 

numRowsCband = CountRows("Term_Dist_C_Band", 1) 

End If 

If testBandD > 0 Then 

numRowsDband = CountRows("Term_Dist_D_Band", 1) 

End If 

numRowsWorld = CountRows(descriptorMatrixSheet, 1) 

Sheets. Add After:=Worksheets(Worksheets.Count) 

' Sheets(Worksheets.Count). Select 
ActiveSheet.Name = sheetName 
currsheetName = sheetName 

Sheets(currsheetName).Move After:=Sheets(descriptorMatrixSheet) 

Sheets(currsheetName).Select 
Active Sheet. StandardWidth = 13 

Sheets(sheetName).Cells(l, 1) = "" 

Sheets(sheetName).Cells(2, 1) = "" 


Sheets(sheetName).Cells(3, 1) = "Step' 
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Sheets(sheetName).Cells(3, 2) = "interval' 


Sheets(sheetName).Cells(3, 3) = "A Band Instances" 
Sheets(sheetName).Cells(3, 4) = "B Band Instances" 
Sheets(sheetName).Cells(3, 5) = "C Band Instances" 
Slieets(slieetName).Cells(3, 6) = "D Band Entropy" 

Slieets(sheetName).Cells(3, 7) = "Sum Band Instances" 

Sheets(sheetName).Cells(3, 8) = "World Entropy Instances" 
Slieets(slieetName).Cells(3, 9) = "Diff World & Band" 


For i = 4 To numColumnsInTerms 
Sheets(slieetName).Cells(i, 1) = i - 3 

Sheets(slieetName).Cells(i, 2) = Slieets("Term_Entropy_Dist_D_Band").Cells(l, i) 

If testBandA > 0 Then 

Sheets(sheetName).Cells(i, 3) = Sheets("Term_Dist_A_Band").Cells(numRowsAband + 1, i) 
End If 

If testBandB > 0 Then 

Sheets(sheetName).Cells(i, 4) = Sheets("Term_Dist_B_Band").Cells(numRowsBband + 1, i) 
End If 

If testBandC > 0 Then 

Sheets(sheetName).Cells(i, 5) = Sheets("Term_Dist_C_Band").Cells(numRowsCband + 1, i) 
End If 

If testBandD > 0 Then 

Sheets(sheetName).Cells(i, 6) = Sheets("Term_Dist_D_Band").Cells(numRowsDband + 1, i) 
End If 

Sheets(sheetName).Cells(i, 7) = Sheets(sheetName).Cells(i, 3) + Sheets(sheetName).Cells(i, 4) + 
Sheets(sheetName).Cells(i, 5) + Sheets(sheetName).Cells(i, 6) 

Sheets(sheetName).Cells(i, 8) = Sheets(descriptorMatrixSheet).Cells(numRowsWorld + 1, i) 
Sheets(sheetName).Cells(i, 9) = Abs(Cells(i, 8) - Sheets(sheetName).Cells(i, 7)) 

Next i 


'testBandA = 0 
'testBandB = 0 
'testBandC = 0 
'testBandD = 0 

End Sub 


Function: fmdStringRowInSheet 
Author: Matt Behnke 
Created: 2/28/02 

Description: determines the row of the string in the given sheet, uses find function 
inputs: matrixSheet, termName (descriptor), column letter of term in matrixSheet 
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' Outputs: the row number 


Function findStringInSheet(ByVal matrixSheet As String, ByVal termName As String, ByVal column As 
String) As String 

With Worksheets(matrixSheet).Range(column & & column) 

Set C = .Find(termName, LookIn:=xlValues) 

If Not C Is Nothing Then 
firstAddress = C.Address 
temp = Sheets(l).Cells(l, 1) 

Sheets(l).Cells(l, 1) = firstAddress 

theRow = Sheets(l).Cells(l, l).Characters(4, 5).Text 

Sheets(l).Cells(l, l) = temp 


findStringInSheet = theRow 
Else 

fmdStringInSheet = 0 
End If 
End With 


End Function 'funciton 


Function: fmdStringRowInSheet ****OBSOLETE*** Slow 
Author: Matt Behnke 
Created: 11/16/01 

Description: determines the row of the string in the given sheet 
inputs: sheetname, descriptor, column of desc in datasheet 
Outputs: row number where the value is found 


Function fmdStringRowInSheet(ByVal matrixSheet As String, ByVal termName As String, ByVal 
columnNum As Integer) As Integer 

foundAt = 0 

numRows = CountRows(matrixSheet, 1) 

For i = 2 To numRows 'assume column header 
If Cells(i, columnNum).Value = termName Then 
foundAt = i 
found = True 
Exit For 
End If 
Next i 

If found = True Then 

findStringRowInSheet = foundAt 
Else 

findStringRowInSheet = 0 
End If 

End Function 


' Subroutine: formatSheetForPrint 
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' Author: Matt Behnke 
' Created: 9/19/01 

' Deseription: formats the sheet to fit on one page wide (legal size paper) 

' adds header and footer to each sheet and sets orientation to landscape 

' inputs: none 
' Outputs: none 


Sub formatSheetForPrint() 

'column heading (R11.3) 

With ActiveSheet.PageSetup 
' .PrintTitleRows = "$l:$l" 

' .PrintTitleColumns = "" 

End With 

' ActiveSheet.PageSetup.PrintArea = "$A$1:$Y$203" 

With ActiveSheet.PageSetup 
' .LeftHeader^"" 

' .CenterHeader = "&A in &F" '(Rl 1.4) 

' .RightFleader = 

' .LeftFooter = "&D" '(R11.5) 

' .CenterFooter = "Page &P of &N" 

' .RightFooter = 

' .LeftMargin = Application.InchesToPoints(0.75) 

' .RightMargin = Apphcation.InchesToPoints(0.75) 
'.TopMargin = Application.InchesToPoints(l) 

' .BottomMargin = Apphcation.InchesToPoints(l) 

' .FleaderMargin = Apphcation.InchesToPoints(0.5) 

' .FooterMargin = Apphcation.InchesToPoints(0.5) 

' .PrintFleadings = False 

' .PrintGridlines = True 

' .PrintComments = xlPrintNoComments 
' .CenterFlorizontally = False 
' .CenterVertically = False 
'.Orientation = xlLandscape 
' .Draft = False 

' .PaperSize = xlPaperLetter 
' .FirstPageNumber = xlAutomatic 
' .Order = xlDownThenOver 
' .BlackAndWhite = False 
' .Zoom = False 
' .FitToPagesWide = 1 
'.FitToPagesTall = 99 
End With 

End Sub 'format sheet for print 

Sub rSquaredtest() 

Call rSquaredSheet("World") 

Call rSquaredSheet("A_Band") 

Call rSquaredSheet("B_Band") 

Call rSquaredSheet("C_Band") 

Call rSquaredSheet("D_Band") 

End Sub 


'(R11.6) 

'(Rll.l) 

'(R11.2) 


' rSquaredSheet 
' author: Matt Behnke 
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created 1/3/02 

creates a new sheet that stores the R*R values of the graphs: 

number of publications over time and cumulative entropy over time 
Uses the information stored in the affiliation Summary sheets.. 


Sub rSquaredSheet(ByVal band As String) 

Sheets. Add After:=Worksheets(Worksheets.Count) 

'Sheets(Worksheets.Count). Select 

currSheet = ActiveSheet.Name 
Sheets(currSheet). Select 

'set the source of the data 
If band = "World" Then 

source = "AffiliationSummary" 

Else 

source = "Affiliation Summary" & band 
End If 

uumRows = CountRows(source, I) 

'fill in the header information of the rsquared sheet 
Call rSquaredSheetHeader(numRows - 3, currSheet, band) 

'determine startrow 
startRow = 4 

While Not (i > 0#) 'if the first time step's value is zero find the step that isn't 0 
i = Sheets(source).Cells(startRow, 3).Value 
If Not (i > 0) Then 

startRow = startRow + I 
End If 
Wend 

counter = 6 

For i = startRow To numRows 
'get the month 

j = 1 

While found = False 

testchar= Sheets(source).Cells(i, 2).Characters(j, l).Text 
If testchar = "/" Then 
found = Trae 
Else 

j=j + l 

End If 
Wend 

'month ends at j 
'get the year 

currentMonth = Sheets(source).CeIIs(i, 2).Characters(I, j - I).Text 
If currentMonth <10 Then 

theYear = Sheets(source).CeIIs(i, 2).Characters(5, 2).Text 
Else 

theYear = Sheets(source).Cells(i, 2).Characters(6, 2).Text 
End If 
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'test the year to see if different from before 
If Not the Year = previousYear Then 
startGraphRange = i 
startStep = Sheets(souree).Cells(i, 1) 
Sheets(eurrSheet).Cells(eounter, 2) = startStep 
If the Year > 50 Then 

Sheets(eurrSheet).Cells(eounter, I) = "19" & theYear 
Else 

Sheets(eurrSheet).Cells(eounter, I) = "20" & theYear 
End If 

Call rSquaredGraph(eurrSheet, startGraphRange, eounter, souree) 
eounter = eounter + I 
End If 

'update previous year value 
previousYear = theYear 
found = False 
Next i 

Sheets(eurrSheet).Name = band & " rSquared Power" 

End Sub 'rSquaredSheet 
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rSquaredHeader 
Author: Matt Behnke 
Created 1 / 3 / 2002 

ereates the header eolumns and formatting for the rsquared sheet 


Sub rSquaredSheetHeader(ByVal numSteps As Integer, ByVal rSquaredSheetName As String, ByVal band 
As String) 

Sheets(rSquaredSheetName). Seleet 
Range("Al").Seleet 

AetiveCell.FormulaRlCl = "R-Squared Values for Ada, " & band 
Range(" A3"). Seleet 

AetiveCell.FormulaRlCl = numSteps & " total steps," & numSteps /12 & " years." 

Range("A4"). Seleet 

AetiveCell.FormulaRlCl = "Number of Publieations" 

Range("A5").Seleet 
AetiveCell.FormulaRlCl = "Year" 

Range("B5").Seleet 

AetiveCell.FormulaRlCl = "Starting Step" 

Range("C5").Seleet 

AetiveCell.FormulaRlCl = "Rsquared" 

Range("D5").Seleet 
AetiveCell.FormulaRlCl = "Equation" 

Columns("A:J").Seleet 
Seleetion.ColumnWidth = 13.29 

Columns("D:D").Seleet 
Seleetion.ColumnWidth =18 

Columns("F :F"). Seleet 
Seleetion.ColumnWidth =18 

Range("C5:D5").Seleet 

Seleetion.Copy 

Range("E5").Seleet 

AetiveSheet.Paste 

Range("E4").Seleet 
Applieation.CutCopyMode = False 
AetiveCell.FormulaRlCl = "Entropy" 

Range("A5:I5").Seleet 

Seleetion.Borders(xlDiagonalDown).LineStyle = xlNone 
Seleetion.Borders(xlDiagonalUp).LineStyle = xlNone 
Seleetion.Borders(xlEdgeLeft).LineStyle = xlNone 
Seleetion.Borders(xlEdgeTop).LineStyle = xlNone 
With Seleetion.Borders(xlEdgeBottom) 

.LineStyle = xlContinuous 
.Weight = xlThin 
.Colorindex = xlAutomatie 
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End With 

Selection.Borders(xlEdgeRight).LineStyle = xlNone 
Selection.Borders(xlInsideVertical).LineStyle = xlNone 

Columns("C:C").Select 

With Selection.Borders(xlEdgeLeft) 

•LineStyle = xlContinuous 
•Weight = xlThin 
•Colorlndex = xlAutomatic 
End With 

Columns("E:E").Select 

With Selection.Borders(xlEdgeLeft) 

•LineStyle = xlContinuous 
•Weight = xlThin 
•Colorlndex = xlAutomatic 
End With 

RangeC'Al")^ Select 
Selection^Font^Bold = True 
End Sub 'rsquaredHeader 


Sub: rSquaredGraph 
author: Matt Behnke 
Date: 1/3/2002 

uses a graph to determine the rsqurared value and equation •• 
uses affdiation summary sheets 


Sub rSquaredGraph(ByVal rSquaredSheetName As String, ByVal startGraphRange As Integer, ByVal 
counter As Integer, ByVal source As String) 

'trendType = xlLinear 
trendType = xlPower 

'number of publications 
Charts^Add 

chartName = ActiveChart^Name 
ActiveChart^ChartType = xlLineMarkers 
ActiveChart^SetSourceData source:=Sheets(source)^Range( _ 

"C" & StartGraphRange & ":C255"), PlotBy:=xlColumns 

With ActiveChart 
•HasTitle = False 

•Axes(xlCategory, xlPrimary)^FIasTitle = False 
•Axes(xlValue, xlPrimary)^FIasTitle = False 
End With 


ActiveChart^ SeriesCollection( !)• Select 

ActiveChart^SeriesCollection(l)^Trendlines^Add(Type:=trendType, Forward:=0, 
Backward:=0, DisplayEquation:=True, DisplayRSquared:=True)^Select 

'get trendline rsq and equation for num publications 
ActiveChart^SeriesCollection(l)^ Select 


260 






With ActiveChart. SeriesCollectioii( 1 ).Trendlines( 1) 
trendEq = .DataLabel.Text 
End With 

Sheets(source). Select 
firstPartEq = firstPartTrendEq(trendEq) 
secondPartEq = secondPartTrendEq(trendEq) 
rSquared = rSquaredTrendEq(trendEq) 

If trendType = xlPower Then 

Sheets(rSquaredSheetName).Cells(counter, 4) = "y=" & firstPartEq & "x^" & secondPartEq 
Else 

Sheets(rSquaredSheetName).Cells(counter, 4) = "y=" & firstPartEq & "x + " & secondPartEq 
End If 

Sheets(rSquaredSheetName).Cells(counter, 3) = rSquared 
'entropy 

Sheets(chartName). Select 

If Not Sheets(source).Cells(startGraphRange, 6) > 0 And trendType = xlPower Then 
Sheets(rSquaredSheetName).Cells(counter, 5) = "N/A due to zero entropy" 


Else 

ActiveChart.SetSourceData source~Sheets(source).Range( _ 

"F" & startGraphRange & ":F255"), PlotBy~xlColumns 

'get trendline rsq and equation for entropy 
ActiveChart. SeriesCollection(l). Select 
W ith ActiveChart. SeriesCollection( 1) .Trendlines( 1) 
trendEq = .DataLabel.Text 
End With 

Sheets(source). Select 
firstPartEq = firstPartTrendEq(trendEq) 
secondPartEq = secondPartTrendEq(trendEq) 
rSquared = rSquaredTrendEq(trendEq) 

If trendType = xlPower Then 

Sheets(rSquaredSheetName).Cells(counter, 6) = "y=" & firstPartEq & "x*" & secondPartEq 
Else 

Sheets(rSquaredSheetName).Cells(counter, 6) = "y=" & firstPartEq & "x + " & secondPartEq 
End If 

Sheets(rSquaredSheetName).Cells(counter, 5) = rSquared 
End If 


'delete chart 

Sheets(rSquaredSheetName). Select 
Application. Display Alerts = False 
Sheets(chartName).Delete 
Application. Display Alerts = True 


End Sub 'rSquaredGraph 


261 




Function: rSquaredTrendEq 
Author: Matt Behnke 
Created: 1/3/02 

Description: extracts the rSquared value of a trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function rSquaredTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Cells(l, 1) 

Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Cells( 1, l).Characters(i, l).Text 
If testchar = "R" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = location of R 

'secondpart starts at character i plus 5.. 

'num of characters = location(x) - 5 
'extract 5 characters.. 

rSquaredTrendEq = Cells(l, l).Characters(i + 5, 6).Text 
Cells(l, 1) = temp Storage 

End Function' rSquaredTrendEqu 


Function: CountRows 

Author: ? Revised by: Matt Behnke 

Created: ? 

Revised: 9/10/01 

Description: Counts the rows in the suppiled worksheet and column number 
inputs: sheetName - name of the sheet to count the rows in 
colNum - number of the column to count rows in 
Outputs: number of rows as a double 


Function CountRows(ByVal sheetName As String, ByVal colNum As Integer) As Double 

On Error Resume Next 

Dim currCell As Range, rowNum As Double 

SheetsC" & sheetName).Select 

If IsNumeric(colNum) Then 
Else 

colNum = 1 
End If 

rowNum = 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 
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Do While currCell.Value <> "" 
rowNum = rowNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 
Loop 

CountRows = rowNum - 1 
End Function 'CountRows 


Function: CountCols 

Author: ? Revised by: Matt Behnke 

Created: ? 

Revised: 9/10/01 

Description: Counts the rows in the suppiled worksheet and column number 
inputs: sheetName - name of the sheet to count the columns in 
rowNum - number of the row to count columns in 
Outputs: number of columns as a double 


Function CountCols(ByVal sheetName As String, ByVal rowNum As Integer) As Integer 

On Error Resume Next 

Dim currCell As Range, colNum As Integer 

SheetsC" & sheetName).Select 

If IsNumeric(rowNum) Then 
Else 

rowNum = 1 
End If 
colNum = 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Do While currCell.Value <> "" 
colNum = colNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Loop 

CountCols = colNum - 1 
End Function 'CountCols 


Function: firstPartTrendEq 
Author: Matt Behnke 
Created: 11/13/01 

Description: extracts the first part of the given POWER trendline equation, works w/ linear 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function firstPartTrendEq(ByVal trendlincEq As String) As Double 

tempStorage = Sheets(dataSheet).Cells(l, 1) 
Sheets(dataSheet).Cells(l, 1) = trendlincEq 

i = 1 

attempt = 1 
While found = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
Ifi> 100 Then 
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i = 1 

attempt = attempt + 1 
End If 

If attempt = 5 Then 

MsgBox ("Trend line read error! string: " & trendlineEq) 

End If 
End If 
Wend 

'i = loeation of x 

'firstpart = starts at eharaeter 5 

'num of eharaeters = loeation(x) - 5 

lirstPartTrendEq = Sheets(dataSheet).Cells(l, l).Charaeters(5, i - 5).Text 
Sheets(dataSheet).Cells(l, 1) = temp Storage 

End Funetion ' first part trendline 


Funetion: seeondPartTrendEq 
Author: Matt Behnke 
Created: 11/13/01 

Deseription: extraets the seeond part of the given POWER trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Funetion seeondPartTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(dataSheet).Cells(l, 1) 

Sheets(dataSheet).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testehar = Sheets(dataSheet).Cells(l, l).Charaeters(i, l).Text 
If testehar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = loeation of x 

'seeondpart starts at eharaeter i plus 1.. 

'num of eharaeters = loeation(x) - 5 
'extraet 5 eharaeters.. 

seeondPartTrendEq = Sheets(dataSheet).Cells(l, l).Charaeters(i + 1, 5).Text 
Sheets(dataSheet).Cells(l, 1) = tempStorage 

End Funetion ' seeondPart eq 


Funetion: seeondPartLinearTrendEq 
Author: Matt Behnke 
Created: 2/5/02 

Deseription: extraets the seeond part of the given linear trendline equation 

inputs: trendline equation 

Outputs: seeondPart of trendline equation 
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Function secondPartLinearTrendEq(ByVal trendlincEq As String) As Double 

tempStorage = Sheets(dataSheet).Cells(l, 1) 

Sheets(dataSheet).Cells(l, 1) = trendlincEq 

i = 1 

While found = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = location of x 

'secondpart starts at character i plus 1.. 

'num of characters = location(x) +6 

'4.143X +2.4441 

* AAAAAAAAA 

'extract 9 characters.. 

secondPartLinearTrendEq = Sheets(dataSheet).Cells(l, l).Characters(i + 1, 9).Text 
Sheets(dataSheet).Cells(l, 1) = tempStorage 

End Function ' secondPart eq 


Function: firstPartPolyTrendEq 
Author: Matt Behnke 
Created: 2/1/02 

Description: extracts the first part of the given trendline equation 
form ax2 + bx + c 
inputs: trendline equation 
Outputs: firstpart of trendline equation 


Function firstPartPolyTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(dataSheet).Cells(l, 1) 
Sheets(dataSheet).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = location of x 

'firstpart = starts at character 5 

'num of characters = location(x) - 5 
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firstPartPolyTrendEq = Sheets(dataSheet).Cells(l, l).Characters(5, i - 5).Text 
Sheets(dataSheet).Cells(l, 1) =tempStorage 


End Function ' first part poly order - 2 trendline 


Function: secondPartPolyTrendEq 
Author: Matt Behnke 
Created: 2/1/02 

Description: extracts the second part of a second order polygonal trendline the given trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function secondPartPolyTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(dataSheet).Cells(l, 1) 

Sheets(dataSheet).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

j =i+ 1 

While found2 = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(j, l).Text 
If testchar = "x" Then 
found2 = True 
Else 

j=j + l 

End If 
Wend 

'i = location of first x 

'j = location of second x 

'secondpart starts at character i plus 5.. 

' il2345 jl234 

'1.0000x2+ 2.001X +8.878 
'num of characters = j - i + 5 


SecondPartPolyTrendEq = Sheets(dataSheet).Cells(l, l).Characters(i + 5, j - (i + 5)).Text 
Sheets(dataSheet).Cells(l, 1) = tempStorage 

End Function ' secondPart poly eq 


Function: thirdPartPolyTrendEq 
Author: Matt Behnke 
Created: 2/1/02 

Description: extracts the third part of a second order polygonal trendline the given trendline equation 
inputs: trendline equation 
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' Outputs: firstpart of trendline equation 


Function thirdPartPolyTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Slieets(dataSlieet).Cells(l, 1) 

Sheets(dataSlieet).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

j =i+ 1 

While found2 = False 

testchar = Sheets(dataSheet).Cells(l, l).Characters(j, l).Text 
If testchar = "x" Then 
found2 = True 
Else 

j=j + l 

End If 
Wend 

'i = location of first x 

'j = location of second x 

'secondpart starts at character i plus 5.. 

' il2345 jl234 

'1.0000x2+ 2.001X +8.878 

'third part starts at character j plus 4.. 

'extract 5 characters.. 

thirdPartPolyTrendEq = Sheets(dataSheet).Cells(l, l).Characters(j + 4, 5).Text 
Sheets(dataSheet).Cells(l, 1) = tempStorage 

End Function ' thirdPart poly eq 


Function: cols 
Author: Matt Behnke 
Created: 9/11/01 

Description: changes column number into a letter, 
inputs: columnNumber 
Outputs: column letter 


Function col(ByVal columnNumber As Integer) As String 

Select Case columnNumber 
Case 1 
col = "A" 

Case 2 
col = "B" 
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Case 3 
col = "C" 
Case 4 
col = "D" 
Case 5 
col = "E" 
Case 6 
col = "F" 


Case 236 
col = "IB" 

Case 237 
col = "IC" 

Case 238 
col = "ID" 

Case 239 
col = "IE" 

Case 240 
col = "IF" 

Case 241 
col = "IG" 

Case 242 
col = "IH" 

Case 243 
col = "11" 

Case 244 
col = "IJ" 

Case 245 
col = "IK" 

Case 246 
col = "IE" 

Case 247 
col = "IM" 

Case 248 
col = "IN" 

Case 249 
col = "lO" 

Case 250 
col = "IP" 

Case 251 
col = "IQ" 

Case 252 
col = "IR" 

Case 253 
col = "IS" 

Case 254 
col = "IT" 

Case 255 
col = "lU" 

Case others 
col = "Z" 

End Select 
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End Function 'col 


Public Function Update_Mathcad_Band_Stats(ByVal mathcad sheet name As String, _ 
ByVal data sheet name As String, ByVal start cell x As Variant, _ 

ByVal start cell y As Variant, ByVal num rows As Integer, ByVal fit type As Integer, 
ByVal tolerance As Double) As Variant 


Function: Update Mathcad Band Stats 
Author: Aaron Micyus 
Last Modified: 12/05/2001 

Description: Given location information for input data this subroutine 
passes data to embedded mathcad object for processing and returns 
obtained values 
inputs: 

mathcad sheet name : This is the name of the sheet the embedded 
object is in 

data sheet name : This is the name of the sheet we will obtain data 
from 

start cell x : This is the cell location we start getting x data from 
start cell y : This is the cell location we start getting y data from 
num rows : This is the number of data rows we have 
fit type : integer value corresponding to fit type to return 
0 - Flyperbolic 3 parameter (k,p,r) 

1 - Exponential 3 parameter (k,p,r) 

2 - Power 2 parameter (b,m) 
tolerance : tolerance value for fit 

Outputs: 

array : returned array will hold calculated k,p,r values 

[1] element one : k 

[2] element two : r 

[3] element three: p 

[4] element four : Rsquared value 


'VARS 

Dim Mathcad As Object 'our interface to the Mathcad 

'embedded object 

Dim data x real, data x imag As Variant 'vars for real and imaginary 

'components of x data 

Dim data y real, data y imag As Variant 'vars for real and imaginary 

'components of y data 

Dim tolerance real, tolerance imag As Variant 'vars for real and imaginary 

'components of tolerance 

Dim k real, k imag As Variant 'vars for real and imag 

'k values from mathcad 

Dim r real, r imag As Variant 'vars for real and imag 

'r values from mathcad 

Dim p real, p imag As Variant 'vars for real and imag 

'p values from mathcad 
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Dim b real, b imag As Variant 'vars for real and imag 

'b values from mathead 

Dim m real, m imag As Variant 'vars for real and imag 

'm values from mathead 

Dim rsquared real, rsquared imag As Variant 'vars for real and imag 

'r squared values from mathead 

Dim eurrent_ehar_position As Variant 'temp var to hold position in string 

Dim range x, range y As Variant 'vars for ealeulated ranges 

Dim fit_results(4) As Variant 'array to hold returned fit data from mathead 


'initialize embedded mathead 

Call Register_]Vlathead_OLE(mathead_sheet_name) 

'aetivate the sheet with the embedded mathead objeet 
W orksheets(mathead_sheet_name) .Aetivate 

'get objeet referenee 

Set Mathead = AetiveSheet.OLEObjeets(l).Objeet 

'aetivate the sheet with the data 
W orksheets(data_sheet_name) .Aetivate 

""""'eonstruet the x value range 

'this temp variable holds eurrent position in string we are parsing through 
eurrent_ehar_position = 1 

'traverse the string until we find a numerie eharaeter 

While Not IsNumerie(Mid(start_eell_x, eurrent_ehar_position, 1)) 

eurrent_ehar_position = eurrent_ehar_position + 1 

Wend 

'ealeulate range string for x 

range x = start eell x & & Left(start_eell_x, 1) 

range x = range x & (Right(start_eell_x, (Len(start_eell_x) - current_ehar_position + 1)) + num rows - 

1 ) 


""""""now eonstruet y value range 

'this temp variable holds current position in string we are parsing through 
current_char_position = 1 

'traverse the string until we find a numeric character 

While Not IsNumeric(Mid(start_cell_y, current_char_position, 1)) 

current_char_position = current_char_position + 1 

Wend 

'calculate range string for y 

range y = start cell y & & Left(start_cell_y, 1) 
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range y = range y & (Right(start_cell_y, (Len(start_cell_y) - current_char_position + 1)) + num rows - 

1 ) 


'set ranges for input data for mathead 
dataxreal = AetiveSheet.Range(range_x).Value 
dataximag = Empty 

datayreal = AetiveSheet.Range(range_y).Value 
datayimag = Empty 


toleranee real = toleranee 'obtained from parameter 

toleraneeimag = Empty 

'import values into mathead 

Call Mathead.SetComplex("X_in", data x real, data x imag) 

Call Mathead.SetComplex("Y_in", data y real, data y imag) 

Call Mathead.SetComplex("eTOL", toleranee real, toleranee imag) 

'have mathead reealeulate sheet 
Call Mathead. Reealeulate 

If fit type = HYP3_FIT Then 

'get values from mathead for exeel 

Call Mathead.GetComplex("outO", k real, k imag) 

Call Mathead.GetComplex("outl", r real, r imag) 

Call Mathead.GetComplex("out2", p real, p imag) 

Call Mathead.GetComplex("out3", rsquared real, rsquared imag) 

'fdl array with results 
fit_results(l) = kreal 
fit_results(2) = rreal 
fit_results(3) = preal 
fit_results(4) = rsquaredreal 

Elself fit type = EXP3_FIT Then 

'get values from mathead for exeel 

Call Mathead.GetComplex("out4", k real, k imag) 

Call Mathead.GetComplex("out5", r real, r imag) 

Call Mathead.GetComplex("out6", p real, p imag) 

Call Mathead.GetComplex("out7", rsquared real, rsquared imag) 

'fill array with results 
fit_results(l) = kreal 
fit_results(2) = rreal 
fit_results(3) = preal 
fit_results(4) = rsquaredreal 

Elself fit type = POW2_FIT Then 

'get values from mathead for exeel 

Call Mathead.GetComplex("out8", b real, b imag) 

Call Mathead.GetComplex("out9", m real, m imag) 

Call Mathead.GetComplex("outlO", rsquared real, rsquared imag) 
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'fill array with results 
fit_results(l) = breal 
fit_results(2) = mreal 
fit_results(3) = Empty 
fit_results(4) = rsquaredreal 

End If 

UpdateMathcadBandStats = fitresults 

'end of Update Mathead Band Stats 

End Funetion 

Publie Funetion Register_Mathead_OLE(ByVal mathead sheet name As String) 


' register mathead ole Maero 

' opens embedded mathead document in order for system to recognize it for future macro 


Sheets(mathcad_sheet_name). Select 
Range("Al"). Activate 
ActiveSheet. Shapes("Obj ect 1"). Select 
Selection.Verb Verb:=xlPrimary 
Range("Al"). Activate 
'Sheets("A_Band_Stats").Select 
End Function 


sub: fillMonthsCol 
Author: Matt Behnke 
Created: 12/11/01 

Description: fills in the months if they are missing.. Inserts a column used for a matrix sheet 
this doesnt work because there are not enough columns 
inputs: sheetName 
Outputs: 


Sub fillMonthsCol()' ByVal sheetName As String) 

sheetName = ActiveSheet.Name 
numColumns = CountCols(sheetName, 1) 

Dim thcMonth As Integer 

counter = 1 

monthCounter = "" & counter 
For i = 4 To numColumns 


j = 1 

While found = False 

testchar = Cells(l, i).Characters(j, l).Text 
If testchar = "/" Then 
found = True 
Else 

j=j + l 

End If 
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Wend 

'month ends at j 

currentMonth = Cells(l, i).Characters(l, j - l).Text 
If currentMonth <10 Then 
restofDate = Cells(l, i).Characters(2, 5).Text 
Else 

restofDate = Cells(l, i).Characters(3, 5).Text 
End If 

theMonth = currentMonth 
If theMonth > monthCounter Then 
While theMonth > monthCounter 
Range(Cells(l, i), Cells(l, i)).Select 
Selection.EntireColumn.Insert 

'copy previous column 

Columns(col(i - 1) & & col(i - 1)).Select 

Selection.Copy 

Columns(col(i) & & col(i)).Select 

Active Sheet. Paste 

Cells(l, i) = & monthCounter & restofDate 

counter = counter + 1 
If counter =13 Then 
counter = 1 
End If 

monthCounter = "" & counter 
i = i + 1 

numColumns = numColumns + 1 
Wend 
End If 


counter = counter + 1 
If counter =13 Then 
counter = 1 
End If 

monthCounter = "" & counter 
found = False 
Next i 

Columns("D:D").ColumnWidth = 6.29 
End Sub ' fdl months col 
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sub: fillMonthsRow 
Author: Matt Behnke 
Created: 12/11/01 

Deseription: fills in the months if they are missing.. Inserts a row 
works on lists.... must run 2-3 times to ensure all filled., 
inputs: sheetName 
Outputs: 


Sub fillMonthsRow(ByVal sheetName As String, ByVal startRow As Integer)' ByVal sheetName As 
String) 

Dim numRows As Integer 
Dim theYear As Integer 
Dim theMonth As Integer 
Dim previousYear As Integer 
Dim previousMonth As Integer 
Dim monthCounter As String 
Dim i As Integer 

Sheets(sheetName). Seleet 
numColumns = CountCols(sheetName, I) 
numRows = CountRows(sheetName, I) 


i = startRow 
eounter = I 

monthCounter = "" & eounter 
While Not i = numRows + I 


j = 1 

While found = False 

testehar = Cells(i, 2).Charaeters(j, I).Text 
If testehar = "/" Then 
found = True 
Else 

j=j + l 

If j > 100 Then 

j = l 

test = test -I-1 
If test > 5 Then 
shit = I 
Exit Sub 
End If 
End If 
End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, 2).Charaeters(I, j - I).Text 
If eurrentMonth <10 Then 

the Year = Cells(i, 2).Charaeters(5, 4).Text 
Else 

theYear = Cells(i, 2).Charaeters(6, 4).Text 
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End If 


theMonth = currentMonth 

If i = startRow Then 
monthCounter = theMonth 
counter = theMonth 
End If 

If the Year = previous Year + I And theMonth =12 And previousMonth = 11 Then 
'case ex: II/I/I988 - > I2/I/I989 
'need to fdl in I2/I/I987 -> I/I/I989 

Range(CeIIs(i, I), CeIIs(i, I)).Select 
Selection.EntireRow.Insert 
Rows(i - I &":"&i - I).SeIect 
Selection.Copy 
Rows(i & & i).Select 

Active Sheet. Paste 

Cells(i, 2) = '"12/1/" & previousYear 
Cells(i, 1) = i - StartRow + 1 
i = i+ 1 
i = i- 1 

numRows = numRows + 1 
End If 


If theMonth < monthCounter Then ' And monthCounter <12 Then ' a new year has begun before a 
previous year finished.. 

Do While Not theMonth = monthCounter 
If i = startRow Then 

Else 

If monthCounter <11 Then 

restofDate = Cells(i - 1, 2).Characters(2, 7).Text 
' theYear = Cells(l, i).Characters(5, 4).Text 
Else 

restofDate = Cells(i - 1, 2).Characters(3, 7).Text 
' theYear = Cells(l, i).Characters(6, 4).Text 
End If 


Range(Cells(i, 1), Cells(i, 1)).Select 
Selection.EntireRow.Insert 

'copy previous row 
Rows(i - 1 & & i - l).Select 

Selection.Copy 
Rows(i & & i).Select 

Active She et. Paste 

Cells(i, 2) = '"" & monthCounter & restofDate 
End If 'i=startrow 
Cells(i, 1) = i - StartRow + 1 
counter = counter + 1 
If counter =13 Then 
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counter = 1 
End If 

monthCounter = "" & counter 
i = i+ 1 

numRows = numRows + 1 

previous Year = the Year 
previousMonth = theMonth 

If theMonth > monthCounter Then 
Exit Do 
End If 
Loop 
End If 


If theMonth > monthCounter Then 
While theMonth > monthCounter 
If i = startRow Then 
'do nothing 
Else 

If currentMonth <10 Then 
restofDate = Cells(i, 2).Characters(2, 7).Text 
Else 

restofDate = Cells(i, 2).Characters(3, 7).Text 
End If 

Range(Cells(i, 1), Cells(i, 1)).Select 
Selection.EntireRow.Insert 


'copy previous row 
Rows(i - 1 &":"&i - l).Select 
Selection.Copy 
Rows(i & & i).Select 

ActiveSheet.Paste 

Cells(i, 2) = & monthCounter & restofDate 

End If 'i=startrow 

Cells(i, 1) = i - StartRow + 1 
counter = counter + 1 
If counter =13 Then 
counter = 1 
End If 

monthCounter = "" & counter 
i = i+ 1 

numRows = numRows + 1 
Ifi>50 Then 
a = 1 
End If 

Wend 
End If 


Cells(i, 1) = i - StartRow + 1 
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counter = counter + 1 
If counter =13 Then 
counter = 1 
End If 

monthCounter = "" & counter 
found = False 
i = i+ 1 

previous Year = the Year 
previousMonth = theMonth 

Wend 

End Sub ' fdl months rows 
Sub testFillMonthsRowsO 

Call ftllMonthsRow("World_Stats", 14) 


End Sub 

Sub fillMonthsRowTriggerO 


If testBandA > 0 Then 

Call fillMonthsRow("A_Band_Stats", 14) 

Call fillMonthsRow("Affdiation_Summary_A_Band", 4) 

End If 

If testBandB > 0 Then 

Call frllMonthsRow("B_Band_Stats", 14) 

Call fillMonthsRow("Affdiation_Summary_B_Band", 4) 

End If 

If testBandC > 0 Then 
Call ftllMonthsRow("C_Band_Stats", 14) 

Call fillMonthsRow("Affdiation_Summary_C_Band", 4) 

End If 

If testBandD > 0 Then 

Call ftllMonthsRow("D_Band_Stats", 14) 

Call fillMonthsRow("Affiliation_Summary_D_Band", 4) 

End If 

Call ftllMonthsRow("World_Stats", 14) 


Call fillMonthsRow("Affihation_Summary", 4) 
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Call fillMonthsRow("Entropy Summary", 4) 
End Sub 


sub v_calc_v_psi_slieet() 

Author: Matt Behnke 
Created: 2/22/02 

Deseription: ereates a sheet either by band or world that has the result of v and psi. 

where v = (num reeords at an instanee in step k)/(total num of reeords at step k) 


(num of authors at an instanee in step k)/(total num of authors at step k) 
psi (tasks per timestep on avg) = v / timestep 
inputs: band - the souree.. 


Sub v_eale_v_psi_sheet(ByVal band As String) 

'Dim authorMatrixTotals As Variant 
'Dim affdiationMatrixTotals As Variant 
Dim sum v array As Variant 

'N i k = the number of reeords produeed by affdiation i at timestep k 
'N Total k = the number of reeords produeed by all affiliations at timestep k 
'P_i_k = the number of authors who published in affdiation i at timestep k 
'P Total k = the number of authors who published in all affdiations at timestep k 

authorMatrixTotals = Array() 
affdiationMatrixTotals = Array() 

Sheets. Add After:=Worksheets(Worksheets.Count) 

'Sheets(Worksheets.Count).Seleet 

ActiveSheet.Name = "v ealeulation" & band 
eurrentSheetName = AetiveSheet.Name 

'move the sheet so it is by related sheets 
' Sheets(currentSheetName).Move Before:=Sheets("" & band & " Stats") 

If band = "World" Then 

affiliationMatrix = datasheet 
authorMatrix = "Affdiationauthors" 

Else 

affiliationMatrix = "Affiliation Cum Dist" & band 
authorMatrix = "Aff Author Cum Dist" & band 
End If 

numRowsInAffiliationMatrix = CountRows(affihationMatrix, I) 
numCoIsInAffdiationMatrix = CountCoIs(affihationMatrix, I) 

numRowsInAuthorMatrix = CountRows(authorMatrix, I) 
numCoIsInAuthorMatrix = CountCoIs(authorMatrix, I) 
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'ReDim affiliationMatrixTotals(4 To numColsInAffiliationMatrix) 

'ReDim authorMatrixTotals(4 To numColsInAuthorMatrix) 

ReDim sum_v_array(4 To numColsInAffiliationMatrix) 

'headers 

Sheets(eurrentSheetName).Cells(l, 1) = "" 

Sheets(eurrentSheetName).Cells(l, 2) = "" 

Sheets(eurrentSheetName).Cells(3, 1) = "Time Step" 

Sheets(eurrentSheetName).Cells(3, 2) = "irrterval" 

Sheets(eurrentSheetName).Cells(3, 3) = "v" 

Sheets(eurrentSheetName).Cells(3,4) = "psi" 

For i = 2 To uumRowsInAuthorMatrix 
For j = 4 To numColsInAffiliationMatrix 
If i = 2 Then 
timeStep = j - 3 

interval = Sheets(authorMatrix).Cells(l, j) 

Sheets(eurrentSheetName).Cells(j, 1) = timeStep 'the timestep 
Sheets(eurrentSheetName).Cells(j, 2) = interval 'the timestep 
End If 

affiliationNameFromAuthorMatrix = Sheets(authorMatrix).Cells(i, 3) 

'find the row that contains the affiliation name from authors matrix in the affiliation matrix 
'sheet 

N i k row = findStringInSheet(affiliationMatrix, affiliationNameFromAuthorMatrix, "C") 

'temp = Sheets(datasheet).Cells(l, 1) 

'Sheets(dataSheet).Cells(l, 1) = N_i_k_range 

'N_i_k_row= Sheets(dataSheet).Cells(l, l).Characters(4, 5).Text 

'Sheets(dataSheet).Cells(l, 1) = temp 

Ifj > 4 Then 'find the values at that instance... not cumulative 

P_i_k = Sheets(authorMatrix).Cells(i, j) - Sheets(authorMatrix).Cells(i, j - 1) 

N i k = Sheets(affiliationMatrix).Cells(N_i_k_row, j) - 
Sheets(affiliationMatrix).Cells(N_i_k_row, j - 1) 

N total = Sheets(affiliationMatrix).Cells(numRowsInAffiliationMatrix + 4, j) _ 

- Sheets(affiliationMatrix).Cells(numRowsInAffiliationMatrix + 4, j - 1) 

P total = Sheets(authorMatrix).Cells(numRowsInAuthorMatrix + 4, j) _ 

- Sheets(authorMatrix).Cells(numRowsInAuthorMatrix + 4, j - 1) 

Else 

P_i_k = Sheets(authorMatrix).Cells(i, j) 

Nik = Sheets(affiliationMatrix).Cells(N_i_k_row, j) 

Ntotal = Sheets(affiliationMatrix).Cells(numRowsInAffiliationMatrix + 4, j) 

Ptotal = Sheets(authorMatrix).Cells(numRowsInAuthorMatrix + 4, j) 

End If 
'calculate v 

' where v = (num records at an instance in step k)/(total num of records at step k) 


' (num of authors at an instance in step k)/(total num of authors at step k) 

If P_i_k And N total > 0 Then 

V = (((N_i_k / N_total) / P_i_k) / P_total) 

Else 
v = 0 
End If 
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sum_v_array(j) = sum_v_array(j) + v 
'debug*****8 

Sheets(curreiitSheetName).Cells(i + 2, j + 4) = sum_v_array(j) 
Sheets(curreiitSheetName).Cells(l, 2) = j 
'debug 
Next j 

Sheets(currentSheetName).Cells(l, 1) 

Next i 

For j = 4 To uumColsInAffdiationMatrix 
'output array of sum v.. make cumulative 
time Step = j - 3 


Ifj > 4 Then 'cumulative 

Sheets(currentSheetName).Cells(j, 3) = sum_v_array(j) + Sheets(currentSheetName).Cells(j -1,3) 
Else 

Sheets(currentSheetName).Cells(j, 3) = sum_v_array(j) 

End If 

psi = Sheets(currentSheetName).Cells(j, 3) / timeStep 

Sheets(currentSheetName).Cells(j, 4) = psi 
Next] 

End Sub 'calc_v_psi_sheet 


sub clearArrayO 
Author: Matt Behnke 
Created: 2/22/02 

Description: clears the values stored in an array. 

inputs: lowerBound - lowerbound of the array 
upperBound - upperbound of the array 
arrayName - the array 


Sub clearArray(ByVal lowerBound As Integer, ByVal upperBound As Integer, ByVal arrayName As 
Variant) 

For i = lowerBound To upperBound 
arrayName(i) = "" 

Next i 

End Sub 


function LinearInterpolation() 

Author: Matt Behnke 
Created: 2/26/02 

Description: linear interpolation used to calculate missing data: 
[ X_i - X low] 

Y_i = y low -I- [-] * (Y_hi - Y low) 

[X_hi - X low] 
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inputs: Y low - the closest "real" value to the left of the missing value 
Y high - the closest "real" value to the right of the missing value 
X low - the closest time step that has data to the left of the missing value 
X high - the closest time step that has data to the right of the missing value 
X_i - the time step that has the missing data., 
output: returns a value, Y_i, for the missing time step. 


Function LinearInterpolation(ByVal Y low As Double, ByVal Y high As Double, ByVal X low As 
Integer, _ 

ByVal X high As Integer, ByVal X_i As Integer) As Double 
Linearinterpolation = Y low + ((X_i - X low) / (X high - X low)) * (Y high - Y low) 

End Function 


sub: FillInMissingDataO 
Author: Matt Behnke 
Created: 2/26/02 

Description: Stores a list of data in an array, traverses the array to find 

points where the data doesn't change. In our case where nothing was added 
due to lack of information (small holes in the dataset). 

When an element that doesn't change is found a linearinterpolation is performed 

to determine what the value should be. 

the value is changed and marked in red. <— not added? 

inputs: datasheet - the source of the data. 

columnNumber - the column that contains the data 
startRow - the row number where the data starts 
endRow - the row number where the data ends 


Sub FillInMissingData(ByVal datasheet As String, ByVal columnNumber As Integer, ByVal startRow As 
Integer, _ 

ByVal endRow As Integer) 

Dim dataArray As Variant 
dataArray = Array() 

numTimeSteps = endRow - startRow + 1 

ReDim dataArray(l To numTimeSteps) 

' numTimeSteps (k) = 5 
' arrayindex (a) = 0 to 5 
' startrow = 3 
' endrow = 7 
' k a row 
' 1 1 3+0=3 
'2 2 3+1=4 
'3 3 3+2=5 
'44 3+3=6 
' 5 5 3+4=7 
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'populate the array with data 
For i = 1 To numTimeSteps 

dataArray(i) = Sheets(dataSheet).Cells(startRow + i - 1, eolumuNumber) 

Next i 

'analyze the array: 

For i = 1 To numTimeSteps 
If Not i = numTimeSteps Then 
eurrentValue = dataArray(i) 
nextValue = dataArray(i +1) 

If eurrentValue = nextValue Then 

lowestDifferentValue = dataArray(i) 'YIow 

lowestDifferentValuePosition = i 'XIow 

For j = i + 2 To numTimeSteps 'sean the array to find the next higher value 
nextHigherValue = dataArray(j) 'Yhigh 

If Not nextHigherValue = eurrentValue Then 
nextHigherValuePosition = j 'Xhigh 

Exit For 'j 
End If 
Next j 

For k = lowestDifferentValuePosition + 1 To nextHigherValuePosition - 1 
'now the next lower and higher values are known along with their positions,eall 
linearlnterpolation 
'k = y_i 

dataArray(k) = LinearInterpolation(lowestDifferentValue, nextHigherValue, 
lowestDifferentValuePosition, nextHigherValuePosition, k) 

Next k 

End If'eurrentValue = nextValue 
End If 'not equal to numTimesteps 
Next i 

'output the array 

For i = I To numTimesteps 

Sheets(dataSheet).Cells(startRow + i - 1, eolumuNumber) = dataArray(i) 

Next i 

End Sub 'fdl in missing data 

'TEST FILL IN MISSING DATA 
Sub testFilllnMissingO 

'tests the linear interpolation funetion.xan also be used as an interfaee to the funetion... 

datasheet = InputBox("enter the name of the souree sheet") 
eolumuNumber = InputBox("enter eolumn number") 
rowStart = InputBox("enter the first row of data") 
rowEnd = InputBox("enter the last row of data") 

Call FillInMissingData(dataSheet, eolumuNumber, rowStart, rowEnd) 


End Sub 


282 



CurveFit 

Author: Erchuang (Al) Wang (original), Matt Behnke - converted to VB 

Converted: 2/27/02 

Description: 

This program will fit a curve up to 10th degree polynomial 
in the form of Y = aO + al*x + a2*x^2 + ... + a(n)*x^(n) 
where n is the degree of the polynomial and l>=n=<30 
Reads in two lists of numbers, X & Y-values and performs the fit 

inputs: datasheet - sheet with the source values 

numValues - the number of values in the array 
X - array of the x values 
y - array of the y-values 
degree - the degree of the polynomial 
Outputs: CoefficientArray - the coefficients of the equation aO, al, ... a(n) 


Function curveFit(ByVal numValues As Integer, _ 

ByVal X As Variant, ByVal y As Variant, ByVal degree As Integer) As Variant 

'numValues = endRow - startRow -I- 1 

'variables 

Dim coefficientArray(64) As Variant 'stores the results, max of 64 coeff. 

'Dim X As Variant 'a one dimension array for x values 

'Dim y As Variant 'a one dimension array for y values 

Dim cn(64) As Variant '??????????????????????????? 

Dim ar(64, 64) As Variant 'a two dimension array 

Dim an(64, 64) As Variant 'answer array 

Dim sum As Double 
Dim t As Double 
Dim d As Double 
Dim b As Double 

Dim j As Integer 
Dim i As Integer 
Dim m As Integer 
Dim n As Integer 
Dim ii As Integer 
Dim k As Integer 
Dim nn As Integer 
Dim nd As Integer 

n = numValues 
nd = degree 
m = nd -I- 1 
nn = m -I- 1 

'generate normal equation A and vector B of Ax=B 
For ii = 1 To n 
For j = 1 To m 

If j = 1 And x(ii) = 0# Then 
ar(ii, j) = 1# 

Else 


'for loop counter 
'for loop counter 

'=numValues, number of data points 
'for loop counter 
'for loop counter 

'^degree, degree of poly 
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ar(ii, j) = x(ii) ^ G - 1) 

End If 
Next j 
Next ii 

For k = 1 To m 
For ii = 1 To m 
sum = 0# 

For j = 1 To n 

sum = sum + ar^, k) * ar^, ii) 
Next] 

an(k, ii) = sum 
Next ii 
Next k 

For ii = 1 To m 
sum = 0# 

For j = 1 To n 

sum = sum + yG) * arG, ii) 
Next) 

en(ii) = sum 
Next ii 

'solve X veetor of Ax=B where A=A' 

For i = 1 To m 
an(i, nn) = en(i) 

Next i 

For i = 1 To m 
k = i 

b = Abs(anG, i)) 

If b = 0# Then 
For j = i To m 
If b < Abs(anG, ii)) Then 
b = Abs(anG, i)) 
k = j 
End If 
Next) 

For j = 1 To nn 
t = an(i, j) 
anG, j) = an(k, j) 
an(k, j) = t 
Next) 

Else 

d = an(i, i) 

End If 

For j = 1 To nn 
anGJ) = anG,j)/d 
Next) 


For j = 1 To m 
b = anG, i) 
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For k = 1 To nn 
If Not j = i Then 

an(j, k) = an(j, k) - an(i, k) * b 
End If 
Next k 
Next j 
Next i 

'put answers into the coeffieient array 
For ii = I To m 

coefficientArray(ii) = an(ii, nn) 

Next ii 

curveFit = coefficientArray 
End Function 'curvefit 
Sub testFindO 
'WORKS'... 

"Phys. Dept., Kakatiya Univ., Warangal, India 
With ActiveSheet.Range("A:A") 

Set C = .Find("3/I979", LookIn:=xlVaIues) 

If Not C Is Nothing Then 
firstAddress = C.Address 
MsgBox (firstAddress) 

End If 
End With 

'MsgBox (Search("Phys. Dept., Kakatiya Univ., Warangal, India")) 
End Sub 
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APPENDIX O - ENTROPY LAMBDA MACRO CODE 


' Macro: EntropyLambda 

' Description: Computes entropy based on lambda and lyapunov exponent 

' Shows "mindshare" by dividing beta by author 

» 

Dim currFilename As String 
Dim betaPercentage As Double 

Sub runEntropyLambda() 

Dim sourceSheet As String 
sourceSheet = "AffiliationSummary" 

currFilename = Application.ActiveWorkbook.Name 

betaPercentage = 0.05 


Call EntropyLambda(sourceSheet) 

Call Correlate_S_Lambda("EntropyLambda") 

End Sub 


Subroutine: entropyLambda 
Author: Matt Behnke 
Created: 9/19/01 

Revised: 9/24 - added map k, k+1 stuff 

2/14/02 - made it a separate macro that uses the "Affdiation Summary" sheet as 
its input. 

Description: 1) Creates a sheet called "entropy lambda" where lambda and the Lyapunov 
number is calculated based on formulas given in the requirements 
inputs: sourceSheet.. uses the affiliation summary sheet. 

Outputs: none 


Sub EntropyLambda(ByVal sourceSheet) 

numRows = CountRows(sourceSheet, 1) 'get the num of rows in the source sheet 
startRange = 4 

'create new sheet 
Sheets.Add 

currentName = ActiveSheet.Name 
Sheets("" & currentName).Name = 
currentName = ActiveSheet.Name 

'set height of header row and standard column width 
Rows("l:l").RowHeight = 52 '(R12.6) 

ActiveSheet.StandardWidth = 12 '(R12.5) 
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'(R6.1) 

("EntropyLambda") 






'headers '(R6.3) 

AetiveSheet.Cells(l, 1) = "" 

AetiveSheet.Cells(2, 1) = "" 

AetiveSheet.Cells(3, 1) = "TimeStep (k)" 

AetiveSheet.Cells(3, 2) = "month/year" 

AetiveSheet.Cells(3, 3) = "Reeords (eumulative sum)" 

AetiveSheet.Cells(3, 4) = "Entropy (S)" 

AetiveSheet.Cells(3, 5) = "S(k+1)" 

AetiveSheet.Cells(3, 6) = "du_(t-e)" 

AetiveSheet.Cells(3, 7) = "du_(t)" 

AetiveSheet.Cells(3, 8) = "B_y_10%" 

AetiveSheet.Cells(3, 9) = "Lambda_B10%_y" 

Aetive Sheet.Cells(3, 10) = "beta" 

AetiveSheet.Cells(2, 10) = betaPercentage 

'fill in eolumn 5: Cum_K+l '(R6.10) 

For i = startRange To numRows 

Sheets(eurrentName).Cells(i, 1) = Sheets(soureeSheet).Cells(i, 1) 

Sheets(eurrentName).Cells(i, 2) = Sheets(soureeSheet).Cells(i, 2) 

Sheets(eurrentName).Cells(i, 3) = Sheets(soureeSheet).Cells(i, 3) 

Sheets(eurrentName).Cells(i, 4) = Sheets(soureeSheet).Cells(i, 8) 

Sheets(eurrentName).Cells(i, 5) = "=D" & i + 1 
Next i 

'ereate the map of k, k+1 to get the trendline equation for '(R6.11) 

'ealeulating the lyanponuv exponent. 

Call CopyMessagesGraph(startRange, numRows, currentName) 

Call CopyMapEntropyKK l (startRange, numRows, eurrentName) 

Call CopyEntropyLambdaChart(startRange, numRows, eurrentName, False) 

' After the three graphs are ealled the following trendlines are plaeed onto the 
' entropy lambda sheet: 

' eells(l,6) eontains the trendline equation of messages (N, Reeords) 

' eells(l,7) eontains the trendline equation of entropy 
' eells(l,8) eontaint the tren of lambda 

'get formula of trendline from entropy power trend graph 
trendEq_entropy = Sheets(eurrentName).Cells(l, 7) 
ms = firstPartTrendEq(trendEq_entropy) 
b_s = seeondPartTrendEq(trendEq_entropy) 

'get formula of trendline from the messages graph 
trendEq_messages = Sheets(eurrentName).Cells(l, 6) 
firstPartmessages = firstPartTrendEq(trendEq_messages) 
seeondPartmessages = seeondPartTrendEq(trendEq_messages) 

For i = startRange + 1 To numRows 
stepK = Sheets(eurrentName).Cells(i, 1) 

Sheets(eurrentName).Cells(i, 6) = firstPart messages 'the derivative of the msgs trendlineEq 
Sheets(eurrentName).Cells(i, 7) = (m s * b_s) * (stepK ^ (b_s - 1)) 

Sheets(eurrentName).Cells(i, 10) = "=J2" 

Sheets(eurrentName).Cells(i, 8) = "=D" & i & "-I" & i 'eells(i,4) - eells (i, 9) 
Sheets(eurrentName).Cells(i, 9) = "=((J" & i & "*F" & i & ")/((J" & i & "*F" & i & ")+G" & i & 
"))^(l/3)" 

'(eells(i,10) * eells(i, 6))/((eells(i, 10)*eells(i,6)) + eells(i,7)) ^ (1/3) 
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' ((J*F) 


' ((J*F)+G)) '^(1/3) 

Next i 

Call CopyEntropyLambdaChart(startRange, numRows, currentName, True) 

'betaPereentage = fmdBestBeta(eurrentName, numRows - 3) not used but ean to determine 
'the optimal beta pereentage— not fully tested.. 

Sheets(eurrentName).Cells(2, 10) = betaPereentage 

With Charts("Entropy Lambda Chart").SeriesColleetion(l).Trendlines(l) 

'put trendline equation onto souree 
.DisplayEquation = True 
.DisplayRSquared = True 

Sheets(eurrentName).Cells(l, 8) = .DataLabel.Text 
End With 

End Sub 'entropyLambda 


Funetion: fmdBestBeta 
Author: Matt Behnke 
Created: 5/9/2002 

Deseription: Finds the best beta value by trying values from .01 to .5 
the results are stored in an array. 

inputs: 

Outputs: none 


Funetion findBestBeta(ByVal soureeSheet As String, ByVal biggestTimeStepValue As Integer) 


Dim bestR As Double 
Dim eurrentR As Double 
Dim eurrentLargestValue As Double 
Dim bestLargestValue As Double 
Dim theBestPereentage As Double 


'.1 % to 30% 

For i = 1 To 300 

'ehange the beta pereentage 

Sheets(soureeSheet).Cells(2, 10) = i /1000 


'get the new trenline equations 

With Charts("Entropy Lambda Chart").SeriesColleetion(l).Trendlines(l) 
'put trendline equation onto souree 
.DisplayEquation = True 
.DisplayRSquared = True 

Sheets(soureeSheet).Cells(l, 8) = .DataLabel.Text 
End With 
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trendEq_lambda = Sheets(sourceSheet).Cells(l, 8) 
mlambda = firstPartTrendEq(trendEq_lambda) 
blambda = secondPartTrendEq(trendEq_lambda) 

'get the eurrent values 

currentR = rSquaredTrendEq(trendEq_lambda) 

currentLargestValue = m lambda * biggestTimeStepValue ^ b lambda 

'compare the values 
If currentLargestValue <1# Then 
If currentR > bestR Then 
bestR = currentR 
theBestPercentage = i /1000 
End If 
Else 

End If 

Next i 

findBestBeta = theBestPercentage 
End Function 


Subroutine: Correlate S Lambda 
Author: Matt Behnke 
Created: 5/19/02 

Description: 1) Creates a sheet called "entropy lambda" where lambda and the Lyapunov 
number is calculated based on formulas given in 
inputs: sourceSheet.. uses the entropy lambda sheet created from EntropyLambda(). 

Outputs: none 


Sub Correlate_S_Lambda(ByVal sourceSheet As String) 

Dim sourceSheet2 As String 'holds the name of the affiliation summary sheet 

sourceSheet2 = "AffiliationSummary" 


numRows = CountRows(sourceSheet, 1) 'get the num of rows in the source sheet 
startRange = 4 

'create new sheet 
Sheets.Add 

currentName = ActiveSheet.Name 
Sheets("" & currentName).Name = 
currentName = ActiveSheet.Name 

'set height of header row and standard column width 
Rows("3:3").RowHeight = 52 '(R12.6) 

ActiveSheet.StandardWidth = 12 '(R12.5) 

'headers '(R6.3) 

Active Sheet.Cells(l, 1) = "m s" 

ActiveSheet.Cells(2, 1) = "b_s" 


'(R6.1) 

("CorrelateSLambda") 


290 






ActiveSheet.Cells(l, 3) = "m lambda" 

ActiveSheet.Cells(2, 3) = "b lambda" 

ActiveSheet.Cells(3, 1) = "TimeStep (k)" 

ActiveSheet.Cells(3, 2) = "month/year" 

ActiveSheet.Cells(3, 3) = "Entropy (S)" 

ActiveSheet.Cells(3, 4) = "Lambda" 

ActiveSheet.Cells(3, 5) = "Entropy(Lambda_k)" 

ActiveSheet.Cells(3, 6) = "delta lambda" 

ActiveSheet.Cells(3, 7) = "Lambda from beta goal" 

ActiveSheet.Cells(3, 8) = "Beta goal to min R^2" 

ActiveSheet.Cells(3, 9) = "Mindshare" 

ActiveSheet.Cells(2, 9) = "beta / authors" 

'get formula of frendlineEq of entropy power trend from the entropy lambda sheet 
frendEq_entropy = Sheets(sourceSheet).Cells(l, 7) 
ms = firstPartTrendEq(frendEq_entropy) 
b_s = secondPartTrendEq(frendEq_entropy) 

Sheets(currentName).Cells(l, 2) = b_s 
Sheets(currentName).Cells(2, 2) = m s 

'get formula of frendlineEq of the messages graph from the entropy lamda sheet 
trendEq_lambda = Sheets(sourceSheet).Cells(l, 8) 
mlambda = firstPartTrendEq(trendEq_lambda) 
blambda = secondPartTrendEq(frendEq_lambda) 

Sheets(currentName).Cells(l, 4) = b lambda 
Sheets(currentName).Cells(2, 4) = m lambda 

Sheets (currentN ame). S elect 

'fill in columns 

For i = startRange + 1 To numRows 
Sheets(currentName).Cells(i, 1) = Sheets(sourceSheet).Cells(i, 1) 
Sheets(currentName).Cells(i, 2) = Sheets(sourceSheet).Cells(i, 2) 
Sheets(currentName).Cells(i, 3) = Sheets(sourceSheet).Cells(i, 4) 
Sheets(currentName).Cells(i, 4) = Sheets(sourceSheet).Cells(i, 9) 
Sheets(currentName).Cells(i, 5) = "=$B$2*(G" & i & "/$D2)^(B1/D1)" 

' G ms 

' b_s * (-) ^ (-) 

' bLambda m lambda 


Sheets(currentName).Cells(i, 6) = "=(C" & i & "-E" & i & ")^2" 
Sheets(currentName).Cells(i, 7) = "=((H" & i & "* EntropyLambdalF" & i & ")/((Fl" 
& i & "* EntropyLambdalF" & i & ") + EntropyLambdaIG" & i & "))'''(l/3)" 

' ((H*F) 


' ((H*F)+G)) ^(1/3) 


Sheets(currentName).Cells(i, 8) = betaPercentage 

'goal seek the difference between actual entropy and calculated., so delta 

'approaches zero. 

RangeC'F" & i).GoalSeek Goal-Q, ChangingCell-Range("H" & i) 

If Sheets(currentName).Cells(i, 8) > 1# Or Sheets(currentName).Cells(i, 8) < 0# Then 
Sheets(currentName).Cells(i, 8) = betaPercentage 
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RangeC'F" & i).GoalSeek Goal-0, ChangingCell~Range("H" & i) 

End If 

If Slieets(currentName).Cells(i, 8) > 1# Or Slieets(currentName).Cells(i, 8) < 0# Then 
Sheets(currentName).Cells(i, 8) = betaPercentage 
End If 

'mindshare = beta / authors 

Sheets(currentName).Cells(i, 9) = "=(H" & i & 7" & sourceSheet2 & "!D" & i & ")" 
Next i 

Call CopyCale_beta_EntropyGraph(startRange, numRows, currentName) 

Call CopyMindshareGraph(startRange, numRows, currentName) 

End Sub 'correlate s e 


Sub: CopyMessagesGraph 
Author: Matt Behnke 
Created: 2/11/02 

Description: copies the graph of messages (Records, N) 
inputs: start range, num of rows on the source sheet 

Outputs: 


Sub CopyMessagesGraph(ByVal startRange As Integer, ByVal endRange As Integer, ByVal source As 
String) 

sheetCount = Sheets.Count 
Application. DisplayAlerts = False 

'GRAPH "Messages" 

Windo ws(" Entropy LambdaMacro. xls"). Activate 
Sheets("Messages").Select 

Sheets("Messages").Copy After:=Workbooks(currFilename).Sheets(sheetCount) 
ActiveChart.SeriesCollection(l). Select 
'msgs - Records, N 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & startRange & "C3:R" & endRange & 

"C3" 

'timesteps 

ActiveChart.SeriesCollection(l).XValues = & source & "!R" & startRange & "C1:R" & endRange & 

"Cl" 

With ActiveChart. SeriesCollection( 1 ).Trendlines( 1) 

'put trendline equation onto source 
.DisplayEquation = True 
.DisplayRSquared = Tme 

Worksheets(source).Cells(l, 6).Value = .DataLabel.Text 
End With 

End Sub 'copy messages graph 
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Sub: CopyMapEntropyKK l 
Author: Matt Behnke 
Created: 2/13/02 

Deseription: eopies the graph of the map of entropy of timestep k& k+1 
inputs: start range, num of rows on the souree sheet 

Outputs: 


Sub CopyMapEntropyKK_l(ByVal startRange As Integer, ByVal endRange As Integer, ByVal souree As 
String) 

sheetCount = Sheets.Count 
Applieation. Display Alerts = False 
'GRAPH "Messages" 

Windows(" EntropyLambdaMaero. xls"). Aetivate 
Sheets("Map Entropy K, K+l").Seleet 

Sheets("Map Entropy K, K+1").Copy After:=Workbooks(eurrFilename).Sheets(sheetCount) 

AetiveChart.SeriesColleetion(l).Seleet 

'entropy k,k+l 

AetiveChart.SeriesColleetion(l).Values = & souree & "!R" & startRange & "C5:R" & endRange - 1 

& "C5" 

'entropy k 

AetiveChart.SeriesColleetion(l).XValues = & souree & "!R" & startRange & "C4:R" & endRange - 

1 & "C4" 

With AetiveChart. SeriesColleetion( 1 ).Trendlines( 1) 

'put trendline equation onto souree 
.DisplayEquation = True 
.DisplayRSquared = True 

'Worksheets(souree).Cells(l, 6).Value = .DataLabel.Text this trendline eq is not used. 

End With 

End Sub 'eopy map k k+1 graph 


Sub: CopyEntropyLambdaChart 
Author: Matt Behnke 
Created: 2/14/02 

Deseription: eopies the graph of the entropy lambda ehart 
inputs: start range, num of rows on the souree sheet 

update - if true it updates the trendline for series 1.. 

this is beeause the graph is added to the worksheet 
before the lambda data is there. 

Outputs: 


Sub CopyEntropyLambdaChart(ByVal startRange As Integer, ByVal endRange As Integer, ByVal souree 
As String, ByVal update As Boolean) 

If update = False Then 

sheetCount = Sheets.Count 
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Application. Display Alerts = False 


'GRAPH "Messages" 

Windo ws(" Entropy LambdaMacro. xls"). Act ivate 
SheetsC'Entropy Lambda Chart").Select 

Sheets("Entropy Lambda Chart").Copy After~Workbooks(currFilename).Sheets(sheetCount) 

ActiveChart.SeriesCollection(l). Select 

'lambda 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & startRange & "C9:R" & endRange & 

"C9" 

'entropy S(k) 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & startRange & "C4:R" & endRange & 

"C4" 

With ActiveChart.SeriesCollection(2).Trendlines(l) 

'put trendline equation onto source 
.DisplayEquation = True 
.DisplayRSquared = True 

Worksheets(source).Cells(l, 7).Value = .DataLabel.Text 'this trendline eq is not used. 

End With 

Else ' If update = True Then 

W ith ActiveChart. SeriesCollection( 1) .T rendlines( 1) 

'put trendline equation onto source 
.DisplayEquation = True 
.DisplayRSquared = True 

Worksheets(source).Cells(l, 8).Value = .DataLabel.Text 
End With 
End If 'update 

End Sub 'Copy Entropy Lambda Chart 


Sub: CopyCalc beta EntropyGraph 
Author: Matt Behnke 
Created: 2/14/02 

Description: copies the graph of the entropy lambda calculated chart 
inputs: start range, num of rows on the source sheet 

before the lambda data is there. 

Outputs: 


Sub CopyCalc_beta_EntropyGraph(ByVal startRange As Integer, ByVal endRange As Integer, ByVal 
source As String) 

sheetCount = Sheets.Count 

Application. Display Alerts = False 

Windo ws(" EntropyLambdaMacro. xls"). Activate 
Sheets("Calc_beta_Entropy (lambda) Chrt").Select 

Sheets("Calc_beta_Entropy (lambda) Chrt").Copy After:=Workbooks(currFilename).Sheets(sheetCount) 
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ActiveChart.SeriesCollection(l). Select 
'entropy S, actual 

ActiveCliart.SeriesCollection(l).Values = & source & "!R" & startRange & "C3:R" & endRange & 

"C3" 

ActiveCliart.SeriesCollection(l).XValues = & source & "!R" & startRange & "C1:R" & endRange & 

"Cl" 

'entropy S as a function of lambda 

ActiveChart.SeriesCollection(2).Values = & source & "!R" & startRange & "C5:R" & endRange & 

"C5" 

ActiveCliart.SeriesCollection(l).XValues = & source & "!R" & startRange & "C1:R" & endRange & 

"Cl" 

'beta_k_s(actual).. from the goal seek to minize the diff between the above two. 
ActiveChart.SeriesCollection(3).Values = & source & "!R" & startRange & "C8:R" & endRange & 

"C8" 

ActiveChart.SeriesCollection(l).XValues = & source & "!R" & startRange & "C1:R" & endRange & 

"Cl" 

End Sub 'Copy Entropy Lambda Chart 


Sub: CopyCalc beta EntropyGraph 
Author: Matt Behnke 
Created: 5/7/02 

Description: copies the graph of mindshare 
inputs: start range, num of rows on the source sheet 
Outputs: 


Sub CopyMindshareGraph(ByVal startRange As Integer, ByVal endRange As Integer, ByVal source As 
String) 

Dim sourceSheet2 As String 'holds the name of the affiliation summary sheet 

sourceSheet2 = "AffiliationSummary" 

sheetCount = Sheets.Count 

Application. DisplayAlerts = False 

Windows(" EntropyLambdaMacro. xls"). Activate 
Sheets("Mindshare"). Select 

Sheets("Mindshare").Copy After:=Workbooks(currFilename).Sheets(sheetCount) 
ActiveChart.SeriesCollection(l). Select 
'author instances 

ActiveChart.SeriesCollection(2).Values = & sourceSheet2 & "!R" & startRange & "C4:R" & 

endRange & "C4" 

ActiveChart.SeriesCollection(l).XValues = & source & "!R" & startRange & "C1:R" & endRange & 

"Cl" 

'beta k per author.. 

ActiveChart.SeriesCollection(l).Values = & source & "!R" & startRange & "C9:R" & endRange & 

"C9" 

ActiveChart.SeriesCollection(l).XValues = & sourceSheet2 & "!R" & startRange & "C1:R" & 

endRange & "Cl" 
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End Sub 'Copy mindshare graph 


Function: rSquaredTrendEq 
Author: Matt Behnke 
Created: 1/3/02 

Description: extracts the rSquared value of a trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function rSquaredTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(l).Cells(l, 1) 

Sheets(l).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Sheets(l).Cells(l, l).Characters(i, l).Text 
If testchar = "R" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = location of R 

'secondpart starts at character i plus 5.. 

'num of characters = location(x) - 5 
'extract 5 characters.. 

rSquaredTrendEq = Sheets(l).Cells(l, l).Characters(i + 5, 6).Text 
Sheets(l).Cells(l, 1) = tempStorage 

End Function' rSquaredTrendEqu 


Function: CountRows 

Author: ? Revised by: Matt Behnke 

Created: ? 

Revised: 9/10/01 

Description: Counts the rows in the suppiled worksheet and column number 
inputs: sheetName - name of the sheet to count the rows in 
colNum - number of the column to count rows in 
Outputs: number of rows as a double 


Function CountRows(ByVal sheetname As String, ByVal colNum As Integer) As Double 

On Error Resume Next 

Dim currCell As Range, rowNum As Double 

SheetsC" & sheetname).Select 

If IsNumeric(colNum) Then 
Else 

colNum = 1 
End If 

rowNum = 1 
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Set currCell = Active Sheet. Cells(rowNum, colNum) 
Do While currCell.Value <> "" 
rowNum = rowNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 
Loop 

CountRows = rowNum - 1 
End Function 'CountRows 


Function: CountCols 

Author: ? Revised by: Matt Behnke 

Created: ? 

Revised: 9/10/01 

Description: Counts the rows in the suppiled worksheet and column number 
inputs: sheetName - name of the sheet to count the columns in 
rowNum - number of the row to count columns in 
Outputs: number of columns as a double 


Function CountCols(ByVal sheetname As String, ByVal rowNum As Integer) As Integer 

On Error Resume Next 

Dim currCell As Range, colNum As Integer 

Sheets("" & sheetname).Select 

If IsNumeric(rowNum) Then 
Else 

rowNum = 1 
End If 
colNum = I 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Do While currCell.Value <> 
colNum = colNum + 1 

Set currCell = ActiveSheet.Cells(rowNum, colNum) 

Loop 

CountCols = colNum - 1 
End Function 'CountCols 


Function: lirstPartTrendEq 
Author: Matt Behnke 
Created: 11/13/01 

Description: extracts the first part of the given trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function firstPartTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(l).Cells(l, 1) 

Sheets(l).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar = Sheets(l).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
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Wend 


'i = location of x 

'firstpart = starts at character 5 

'num of characters = location(x) - 5 

firstPartTrendEq = Sheets(l).Cells(l, l).Characters(5, i - 5).Text 
Sheets(l).Cells(l, 1) = tempStorage 

End Function ' first part trendline 


Function: secondPartTrendEq 
Author: Matt Behnke 
Created: 11/13/01 

Description: extracts the first part of the given trendline equation 

inputs: trendline equation 

Outputs: firstpart of trendline equation 


Function secondPartTrendEq(ByVal trendlineEq As String) As Double 

tempStorage = Sheets(l).Cells(l, 1) 

Sheets(l).Cells(l, 1) = trendlineEq 

i = 1 

While found = False 

testchar= Sheets(l).Cells(l, l).Characters(i, l).Text 
If testchar = "x" Then 
found = True 
Else 
i = i+ 1 
End If 
Wend 

'i = location of x 

'secondpart starts at character i plus 1.. 

'num of characters = location(x) - 5 
'extract 5 characters.. 

j 

While found2 = False 

testchar = Sheets(l).Cells(l, l).Characters(j, l).Text 
If testchar = "R" Then 
found2 = True 
Else 

j=j + l 

End If 
Wend 

stopchar = (j - I) - (i + I) 

secondPartTrendEq = Sheets(I).CeIIs(I, I).Characters(i + I, stopchar).Text 
Sheets(I).CeIIs(I, I) = tempStorage 

End Function ' secondPart eq 
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APPENDIX P - Q LEVEL ANALYSIS MACRO CODE 


Macro: Q Level Analysis 
Author: Matt Behnke 
Created: 1/18/02 

Description: Contains the qlevel analysis functions - code is needed from 

the affdiation distribution macro. Will need to separate this dependency. 

inputs: 

Outputs: 


sub: qLevelTrigger 
Author: Matt Behnke 
Created: 1/18/02 

Description: activates the qlevel functions - some code maybe needed from 
the affiliation distribution macro 

inputs: 

Outputs: 


Sub qLevelTriggerQ 

Dim graphStart As Integer 

qLevelType = InputBox("Enter Author or Term:") 
numQLevels = InputBox("Enter number of qLevels:") 

If qLevelType = "Author" Or qLevelType = "Term" And numQLevels > 0 Then 'check input 

If qLevelType = "Author" Then 
prefix = 

Else 

prefix ="" 

End If 

Call qLevelCumulative(qLevelType, prefix, numQLevels) 

Call qLevelSummary(qLevelType, prefix, numQLevels) 

Call qLevelYears(qLevelType, prefix, numQLevels) 

Call qLevelCopyYearsGraphs(numQLevels) 

Call qLevelMonths(qLevelType, prefix, numQLevels, False) 

Call qLevelMonths(qLevelType, prefix, numQLevels, True) 'calculate mass 
Call qLevelMonthsCount(qLevelType, prefix, numQLevels) 

Call qLevelEntropy(qLevelType, prefix, numQLevels) 

Call qLevelMonthTemp(numQLevels) 

Call qLevelEntropy2(qLevelType, prefix, numQLevels) 

If qLevelType = "Term" Then 

graphStart = qLevelGraphStart("q_level_monthly_temp", 3, 4) 

Call qLevelCopyGraphs(graphStart, numQLevels) 

End If 

Else 
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MsgBox ("WRONG INPUT.. TRY AGAIN!") 
End If 

End Sub 


sub: qLevelCumulative 
Author: Matt Behnke 
Created: I/I6/02 

Deseription: . .puts in eumulative values for eaeh time step, and sums at the bottom 
inputs: qlevelType: author or term 

prefix: the sheet prefix, ehanges whether its author or term 
uumqLevels: the number of q levels 
Outputs: 


Sub qLevelCumulative(ByVal qLevelType As String, ByVal prefix As String, ByVal uumQLevels As 
Integer) 

For i = 1 To uumQLevels 
If i < 10 Then 

Call CaleCumulative(prefix & "0" & i & & qLevelType & " month") 

Else 

Call CaleCumulative(prefix & "" & i & & qLevelType & " month") 

End If 
Next i 

End Sub 


sub: qLevelSummary 
Author: Matt Behnke 
Created: 1/14/02 

Deseription: ereates a summary sheet for a q_level, lists time steps and num of instanees 
for eaeh q level 

inputs: qlevelType: author or term 

prefix: the sheet prefix, ehanges whether its author or term 
uumqLevels: the number of q levels 
Outputs: 


Sub qLevelSummary(ByVal qLevelType As String, ByVal prefix As String, ByVal uumQLevels As 
Integer) 

For z = 1 To uumQLevels 
If z < 10 Then 

sheetName = prefix & "0" & z & & qLevelType & " month" 

summarySheet = prefix & "0" & z & & qLevelType & " summary" 

Else 

sheetName = prefix & "" & z & & qLevelType & " month" 

summarySheet = prefix & "" & z & & qLevelType & " summary" 

End If 

Sheets .Add After:=Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Seleet 
AetiveSheet.Name = summarySheet 
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numColumns = CountCols(sheetName, 1) 
numRows = CountRows(sheetName, 1) 

Sheets(summarySheet).Cells(l, 1) = "q level" 
Slieets(summarySlieet).Cells(l, 2) = z 


Slieets(summarySlieet).Cells(2, 1) = " " 

Slieets(summarySlieet).Cells(3, 1) = "Time Steps" 

Sheets(summarySheet).Cells(3, 2) = "sum" 

Slieets(summarySlieet).Cells(3, 3) = "count" 

For i = 4 To numColumns 

Sheets(summarySheet).Cells(i, 1) = Sheets(sheetName).Cells(l, i) 

Sheets(summarySheet).Cells(i, 2) = "=SUM("' & sheetName & & col(i) & "2:" & col(i) & 

numRows & ")" 

Sheets(summarySheet).Cells(i, 3) = "=Count("' & sheetName & & col(i) & "2:" & col(i) & 

numRows & ")" 

Next i 
Next z 

End Sub 


sub: qLevelYears 
Author: Matt Behnke 
Created: 1/16/02 

Description: uses an array of years to store the amount of instances for a year. 

outputs each year of the dataset to each summary sheet and number of instances 
inputs: qlevelType: author or term 

prefix: the sheet prefix, changes whether its author or term 
numqLevels: the number of q levels 
Outputs: 


Sub qLevelYears(ByVal qLevelType As String, ByVal prefix As String, ByVal numQLevels As Integer) 

'number of ntuples 
nTuples = numQLevels 

Dim years As Variant 

overallQSummary = "q_summary_year" 

firstYear = "2500" 
firstYearOffset = 6 
lastY ear = 0 

Sheets. Add After:=Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = overallQSummary 

Cells(l, 1) = "" 

Cells(2, 1) = "" 

Cells(3, 1) = "" 


301 






For z = 1 To nTuples 


years = Array(0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0) 


If z < 10 Then 

summarySheet = prefix & "0" & z & & qLevelType & " summary" 

Else 

summarySheet = prefix & "" & z & & qLevelType & " summary" 

End If 


numRows = CountRows(summary Sheet, 1) 

For i = 4 To numRows 
'determine eurrent year 

j = 1 

While found = False 

testehar = Cells(i, l).Charaeters(j, l).Text 
If testehar = "/" Then 
found = Trae 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, l).Charaeters(l, j - l).Text 
If eurrentMonth <10 Then 

the Year = Cells(i, l).Charaeters(5, 4).Text 
Else 

theYear = Cells(i, l).Charaeters(6, 4).Text 
End If 


'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 


'eheek to see if the eurrentYear is less than first year 
If theYear < first Year Then 
first Year = theYear 

firstYearOffset = firstYearOffset - 1 'array index 
End If 


If theYear > lastYear Then 
last Year = theYear 
End If 

yearOffset = theYear - firstYear + 5 'so there ean be 5 years less data than the first firstYear value 
years(yearOffset) = Sheets(summarySheet).Cells(i, 2) 
found = False 
Next i 


'output yearsArray 
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Sheets(summarySheet).Cells(3, 4) = "Years" 
Sheets(summarySheet).Cells(3, 5) = "instances' 

counter = 4 'for output 


For X = firstYearOffset To lastYear - firstYear + firstYearOffset 

Sheets(summarySheet).Cells(counter, 4) = firstYear + x - firstYearOffset 
Sheets(summarySheet).Cells(counter, 5) = years(x) 'instances 
If z = 1 Then 

Sheets(overallQSummary).Cells(counter, z) = firstYear + x - firstYearOffset 
Sheets(overallQSummary).Cells(counter, z + 1) = 0 
End If 

Sheets(overaIIQSummary).Cells(counter, z + 2) = years(x) 

'if there is a zero in a year then the put the previous years value into the current year (cumulative) 

If Sheets(overallQSummary).Cells(counter, z + 2) = 0 And x > firstYearOffset Then 

Sheets(overallQSummary).Cells(counter, z + 2) = Sheets(overallQSummary).Cells(counter - 1, z + 

End If 

If X = firstYearOffset Then 

Sheets(overallQSummary).Cells(counter - 1, z + 2) = z 
End If 


counter = counter + 1 

Next X 

Next z 
'copy chart 

'currFilename = Application. ActiveWorkbook.Name 
'Windows("AffiliationMacro.xls"). Activate 
'Sheets("q_level_yr").Select 

'Sheets("q_level_yr").Copy After~Workbooks(currFilename).Sheets(l) 

I 

'counter = 4 
'columnStart = 2 
'For z = 2 To nTuples 

'ActiveChart.SeriesCollection(z - l).Values = & overallQSummary & "!R" & counter & "C" & 

columnStart & ":R" & counter & "C" & ntuples 
' counter = counter + 1 
'Next z 

End Sub 


sub: qLevelMonths 

Author: Matt Behnke 

Created: 1/27/02-finished 2/4/02 

Description: uses an array of years and months to store the amount of instances for each timestep k. 

outputs the number of cumulative instances per month / year 
inputs: qlevelType: author or term 

prefix: the sheet prefix, changes whether its author or term 
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' numqLevels: the number of q levels 
' Outputs: 


Sub qLevelMonths(ByVal qLevelType As String, ByVal prefix As String, ByVal uumQLevels As Integer, 
ByVal mass As Boolean) 

'number of ntuples 
nTuples = numQLevels 

Dim years As Variant 
Dim yearsMonths As Variant 

first Year = "2500" 
lastY ear = 0 

If mass = True Then 

overallQSummary = "q_summary_monthly_count_mass" 

Else 

overallQSummary = "q_summary_monthly_instances" 

End If 

Sheets. Add After:=Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = overallQSummary 

Sheets(overallQSummary).Cells(l, 1) = " " 

Sheets(overallQSummary).Cells(2, 1) = " " 

Sheets(overallQSummary).Cells(3, 1) = " " 

Cells(4, 2). Select 

ActiveWindow.FreezePanes = Trae 

years = Array() 
yearsMonths = Array() 

'ReDim yearsMonths(0 To 0, 1 To 12) 

For z = 1 To nTuples 

'get the source sheet's name 
If z < 10 Then 

summarySheet = prefix & "0" & z & & qLevelType & " summary" 

Else 

summarySheet = prefix & "" & z & & qLevelType & " summary" 

End If 

numRows = CountRows(summary Sheet, 1) 

'scan the first sheet to get the last and first year to get the array bounds 
If z = 1 Then 

For i = 4 To numRows 

'determine current year 

j = 1 

While found = False 

testchar = Cells(i, l).Characters(j, l).Text 
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If testchar = "/" Then 
found = True 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, l).Characters(l, j - l).Text 
If eurrentMonth <10 Then 

theYear = Cells(i, l).Characters(5, 4).Text 
Else 

theYear = Cells(i, l).Characters(6, 4).Text 
End If 

'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 

'check to see if the currentYear is less than first year 
If the Year < firstYear Then 
firstYear = theYear 
End If 

If theYear > lastYear Then 
lastYear = theYear 
End If 

found = False 

Next i 'done scanning the sheet now redim the array 
'ReDim yearsMonths(firstYear To lastYear, 1 To 12) 
End If'z= 1 

ReDim yearsMonths(firstYear To lastYear, 1 To 12) 

For i = 4 To numRows 'now process all the nTuple sheets 
'determine current year 

j = 1 

While found = False 

testchar = Cells(i, l).Characters(j, l).Text 
If testchar = "/" Then 
found = Trae 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, l).Characters(l, j - l).Text 
If eurrentMonth <10 Then 

theYear = Cells(i, l).Characters(5, 4).Text 
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Else 

theYear = Cells(i, l).Characters(6, 4).Text 
End If 

'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 

yearsMonths(theYear, eurrentMonth) = Sheets(summarySheet).Cells(i, 2) 
found = False 
Next i 

'output yearsArray 

'Sheets(summarySheet).Cells(3, 4) = "Years" 
'Sheets(summarySheet).Cells(3, 5) = "instanees" 

eounter = 4 'for output 
monthCounter = 1 
lastValue = 0 


For j = firstYear To last Year 
Fork = 1 To 12 

If"" & j = firstYear And k = 1 Then 'label the q levels 
Sheets(overallQSummary).Cells(eounter - 1, z + 2) = z 
End If 

If z = 1 Then 'put the month/year 

Sheets(overallQSummary).Cells(eounter, z) = '"" & monthCounter & "/" & j 
Sheets(overallQSummary).Cells(eounter, z + 1) = 0 
End If 

eurrentValue = yearsMonths(j, monthCounter) 

If mass = True Then 
multiplyer = z 
Else 

multiplyer = 1 
End If 

If eurrentValue > lastValue Then 

Sheets(overallQSummary).Cells(eounter, z + 2) = eurrentValue * multiplyer 
lastValue = eurrentValue 
Else 

Sheets(overallQSummary).Cells(eounter, z + 2) = lastValue * multiplyer 
End If 

monthCounter = monthCounter + 1 
If monthCounter >12 Then 
monthCounter = 1 
End If 
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If z = nTuples Then 

Sheets(overallQSummary).Cells(counter, z + 3) = '-SUM(" & col(2) & counter _ 
& & col(nTuples + 2) & counter & ")" 

End If 

counter = counter + I 

Next k 
Next j 

'If z = nTuples Then 

' Sheets(overallQSummary).Cells(counter - 1, z + 3) = "=SUM(" * col(2) & counter - 1 
& & col(nTuples + 2) & counter - 1 & ")" 

' End If 


Next z 

'total of all the qlevels: 

Sheets(overallQSummary).Cells(l, 1) = "=SUM(" & col(2) & counter - 1 & & col(nTuples + 2) & 

counter - 1 & ")" 


'copy chart 

' currFilename = Application.ActiveWorkbook.Name 

' Windows("AffiliationMacro.xls").Activate 
' Sheets("q_level_yr").Select 

' Sheets("c|_level_yr").Copy After~Workbooks(currFilename).Sheets(Sheets.Count) 

' counter = 4 
' columnStart = 2 
' For z = 2 To nTuples 

' ActiveChart.SeriesCollection(z - l).Values = & overallQSummary & "!R" & counter & "C" & 

columnStart & ":R" & counter & "C" & nTuples 
' counter = counter + 1 
' Next z 

End Sub 'qlevel months 


sub: qLevelMonthsCOUNT 
Author: Matt Behnke 
Created: 2/17/02 

Description: uses an array of years and months to store the amount of instances for each timestep k. 

outputs the number of terms in the vocab per month / year puts 
inputs: qlevelType: author or term 

prefix: the sheet prefix, changes whether its author or term 
numqLevels: the number of q levels 

Outputs: 


Sub qLevelMonthsCount(ByVal qLevelType As String, ByVal prefix As String, ByVal uumQLevels As 
Integer) 

'number of ntuples 
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nTuples = numQLevels 


Dim years As Variant 
Dim yearsMonths As Variant 

first Year = "2500" 
lastY ear = 0 

overallQSummary = "c|_summary_monthly_eount_eount" 

Sheets. Add After~Sheets(Sheets.Count) 

'Sheets(Sheets.Count).Seleet 
AetiveSheet.Name = overallQSummary 

Sheets(overallQSummary).Cells(l, 1) = " " 
Sheets(overallQSummary).Cells(2, 1) = " " 
Sheets(overallQSummary).Cells(3, 1) = " " 

Cells(4, 2). Seleet 

AetiveWindow.FreezePanes = Tme 

years = Array() 
yearsMonths = Array() 

'ReDim yearsMonths(0 To 0, 1 To 12) 

For z = 1 To nTuples 

'get the souree sheet's name 
If z < 10 Then 

summarySheet = prefix & "0" & z & & qLevelType & " summary" 

Else 

summarySheet = prefix & "" & z & & qLevelType & " summary" 

End If 

numRows = CountRows(summary Sheet, 1) 

'sean the first sheet to get the last and first year to get the array bounds 
If z = 1 Then 

For i = 4 To numRows 

'determine eurrent year 

j = 1 

While found = False 

testehar = Cells(i, l).Charaeters(j, l).Text 
If testehar = "/" Then 
found = Tme 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, l).Charaeters(l, j - l).Text 
If eurrentMonth <10 Then 

theYear = Cells(i, l).Charaeters(5, 4).Text 
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Else 

theYear = Cells(i, l).Characters(6, 4).Text 
End If 

'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 

'eheek to see if the eurrentYear is less than first year 
If the Year < firstYear Then 
firstYear = theYear 
End If 

If the Year > lastYear Then 
lastYear = theYear 
End If 

found = False 

Next i 'done seanning the sheet now redim the array 
'ReDim yearsMonths(firstYear To lastYear, 1 To 12) 

End If'z= 1 

ReDim yearsMonths(firstYear To lastYear, 1 To 12) 

For i = 4 To numRows 'now proeess all the nTuple sheets 
'determine eurrent year 

j = 1 

While found = False 

testehar = Cells(i, l).Charaeters(j, l).Text 
If testehar = "/" Then 
found = True 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(i, l).Charaeters(l, j - l).Text 
If eurrentMonth <10 Then 

theYear = Cells(i, l).Charaeters(5, 4).Text 
Else 

theYear = Cells(i, l).Charaeters(6, 4).Text 
End If 

'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 

yearsMonths(theYear, eurrentMonth) = Sheets(summarySheet).Cells(i, 3) 'eount 
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found = False 
Next i 

eounter = 4 'for output 
monthCounter = 1 
lastValue = 0 

'numY ears = UBound(yearsMonthsa) 

For j = firstYear To lastYear 
Fork = 1 To 12 

If"" & j = firstYear And k = 1 Then 'label the q levels 
Sheets(overallQSummary).Cells(eounter - 1, z + 2) = z 
End If 

If z = I Then 'put the month/year 

Sheets(overallQSummary).CeIIs(eounter, z) = '"" & monthCounter & "/" & j 
Sheets(overallQSummary).CeIls(eounter, z + I) = 0 
End If 

eurrentValue = yearsMonths(j, monthCounter) 

If mass = True Then 
multiplyer = z 
Else 

multiplyer = 1 
End If 

If eurrentValue > lastValue Then 

Sheets(overallQSummary).Cells(eounter, z + 2) = eurrentValue * multiplyer 
lastValue = eurrentValue 
Else 

Sheets(overallQSummary).Cells(eounter, z + 2) = lastValue * multiplyer 
End If 

monthCounter = monthCounter + I 
If monthCounter >12 Then 
monthCounter = I 
End If 

If z = nTuples Then 

Sheets(overallQSummary).CeIIs(eounter, z + 3) = "=SUM(" & eol(2) & eounter 
& ":" & eol(nTupIes + 2) & eounter & ")" 

End If 


eounter = eounter + I 

Next k 
Next) 

' If z = nTuples Then 

' Sheets(overallQSummary).CeIls(eounter - I, z + 3) = "=SUM(" * eoI(2) & eounter - I 
' & & eoI(nTupIes + 2) & eounter - I & ")" 

' End If 
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Next z 

'total of all the qlevels: 

Sheets(overallQSummary).Cells(l, 1) = "=SUM(" & eol(2) & eounter - 1 & & eol(iiTuples + 2) & 

eounter - 1 & ")" 

End Sub 'qlevel months CoUNT 


sub: qLevelEntropy 
Author: Matt Behnke 
Created: 2/4/02 
finished: 3/11/02 

Deseription: uses an array of years and months to store the amount of entropy for each timestep k. 

outputs the cumulative entropy per timestep per q level 
inputs: qlevelType: author or term 

prefix: the sheet prefix, changes whether its author or term 
numqLevels: the number of q levels 
Outputs: 


Sub qLevelEntropy(ByVal qLevelType As String, ByVal prefix As String, ByVal uumQLevels As Integer) 

'number of ntuples 
uTuples = uumQLevels 

Dim years As Variant 

Dim yearsMonths As Variant 

Dim contributiouEntropy As Variant 

first Year = "2500" 
lastY ear = 0 

overallQEntropy = "q_local_entropy_monthly" 
contributiouEutropySheet = "q_contribution_entropy_monthly" 
countSheet = "q_summary_monthly_instances" 

'add the contribution entropy sheet 
Sheets. Add After:=Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = contributionEntropySheet 
Cells(4, 2). Select 

ActiveWindow.FreezePanes = Tme 

'add the local entropy sheet 

Sheets. Add After:=Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = overallQEntropy 
Cells(4, 2). Select 

ActiveWindow.FreezePanes = Tme 

years = Array() 
yearsMonths = Array() 
contributiouEntropy = Array() 
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'ReDim yearsMonths(0 To 0, 1 To 12) 

For z = 1 To nTuples 

'get the souree sheet's name 
If z < 10 Then 

summarySheet = prefix & "0" & z & & qLevelType & " month" 

Else 

summarySheet = prefix & "" & z & & qLevelType & " month" 

End If 

numRows = CountRows(summary Sheet, 1) 
numCols = CountCols(summary Sheet, 1) 

'find the row that eontains the time eounts for the eurrent time step... 

'sean the first sheet to get the last and first year to get the array bounds 
If z = 1 Then 

For i = 4 To numCols 

'determine eurrent year 

j = 1 

While found = False 

testehar = Cells(l, i).Charaeters(j, l).Text 
If testehar = "/" Then 
found = True 
Else 

j=j + l 

End If 
Wend 

'month ends at j 

eurrentMonth = Cells(l, i).Charaeters(l, j - l).Text 
If eurrentMonth <10 Then 

theYear = Cells(l, i).Charaeters(5, 4).Text 
Else 

theYear = Cells(l, i).Charaeters(6, 4).Text 
End If 

' If theYear > 50 Then 'add prefix to the year 
' theYear = "19" & theYear 
' Else 

' theYear = "20" & theYear 
' End If 

'eheek to see if the eurrentYear is less than first year 
If theYear < first Year Then 
firstYear = theYear 
End If 

If theYear > lastYear Then 
lastYear = theYear 
End If 

found = False 
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Next i 'done scanning the sheet now redim the array 
'ReDim yearsMonths(firstYear To lastYear, 1 To 12) 

End If'z= 1 

ReDim yearsMonths(firstYear To lastYear, 1 To 12) 

ReDim contributionEntropy(firstYear To lastYear, 1 To 12) 

For i = 4 To numCols 'now process all the nTuple sheets 

'determine current year 

j = 1 

While found = False 

testchar= Sheets(summarySheet).Cells(l, i).Characters(j, l).Text 
If testchar = "/" Then 
found = True 
Else 

j + 1 

'found = True 
End If 
Wend 

'month ends at j 

currentMonth = Sheets(summarySheet).Cells(l, i).Characters(l, j - l).Text 
If currentMonth <10 Then 

theYear = Sheets(summarySheet).Cells(l, i).Characters(5, 4).Text 
Else 

theYear = Sheets(summarySheet).Cells(l, i).Characters(6, 4).Text 
End If 

'If theYear > 50 Then 'add prefix to the year 

' theYear = "19" & theYear 

'Else 

' theYear = "20" & theYear 
'End If 

time Step = "" & currentMonth & "/" & theYear 
timeStepRow = findStringInSheet(countSheet, timeStep, "A") 

' temp = Sheets(summarySheet).Cells(l, 1) 

' Sheets(summarySheet).Cells(l, 1) = timeStepRange 
' timeStepRow = Sheets(summarySheet).Cells(l, l).Characters(4, 5).Text 
' Sheets(summarySheet).Cells(l, l) = temp 

totallnstances = Sheets(countSheet).Cells(timeStepRow, uTuples + 3) '0_Q_k 
localSumlnstances = Sheets(countSheet).Cells(timeStepRow, z + 2) '0_CLk 


' Sh(A)_k_q = num instances A at k in q_level num instances A at k in c|_lvl 

' - * [Qg2 - 

' sum of instances at k in q_lvl sum of instances at k in q_lvl 
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Sh(k)_q = Sh(A)_k + Sh(B)_k + ... 


' [ sum instances at k in c|_lvl (0_q_k) ] 

'contribution Cs qlevel k = abs [- ] 

' [ sum instances at all Q lvls (0_Q_k) ] 


0_q_k 0_Q_k 

+ -* log2- 


0_Q_k 0_q_k 


* Sh(k)_q 


'traverse all the rows in the summarySheet to get the num of instances of each term 
'and the entropies of each term 

'store the sum of the entropies of each term in step k in the yearsMonths array. 

For j = 2 To numRows 

If Sheets(summarySheet).Cells(j, i) > 0 Then 
theValue = Sheets(summarySheet).Cells(j, i) 

entropy = (-theValue / localSumlnstances) * (Log(theValue / localSumlnstances) / Log(2)) 

yearsMonths(theYear, currentMonth) = yearsMonths(theYear, currentMonth) -I- entropy 
End If 

'at the last term in the time step compute the contribution entropy 
If j = numRows Then 

contributionEntropy(theYear, currentMonth) = Abs(localSumInstances / totallnstances) _ 

* yearsMonths(theYear, currentMonth) -I- ((localSumlnstances / totallnstances) 

* (Log(totalInstances / localSumlnstances) / Log(2))) 

End If 
Next) 

found = False 
Next i 

'output yearsArray 

'Sheets(summarySheet).Cells(3, 4) = "Years" 

'Sheets(summarySheet).Cells(3, 5) = "instances" 

counter = 4 'for output 
monthCounter = 1 
lastValue = 0 
lastContributionValue = 0 

For j = firstYear To lastYear 

Fork = 1 To 12 

If& j = firstYear And k = 1 Then 'label the q level 
Sheets(overallQEntropy).Cells(counter - 1, z -l- 2) = z 
End If 

If z = 1 Then 'put the month/year 

Sheets(overallQEntropy).Cells(counter, z) = & monthCounter & "/" & j 

Sheets(overallQEntropy).Cells(counter, 7 .+ 1) = 0 
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Sheets("q_coiitribution_entropy_monthly").Cells(counter, z) = & monthCounter & "/" & j 

Sheets("q_coiitribution_entropy_moiithly").Cells(couiiter, z + 1) = 0 
End If 

currentValue = yearsMontbs(j, montbCounter) 
currentContributionValue = contributionEntropy(j, montbCounter) 

If currentValue > 0 Or Not currentValue = Null Then 

Sbeets(overallQEntropy).Cells(counter, z + 2) = currentValue 

lastValue = currentValue 
Else 

Sbeets(overallQEntropy).Cells(counter, z + 2) = lastValue 
End If 

If currentContributionValue > 0 Or Not currentContributionValue = Null Then 

Sbeets(contributionEnbopySbeet).Cells(counter, z + 2) = currentContributionValue 
lastContributionValue = currentContributionValue 
Else 

Sbeets(contributionEntropySbeet).Cells(counter, z + 2) = lastContributionValue 
End If 

montbCounter = montbCounter + 1 
If montbCounter >12 Then 
montbCounter = 1 
End If 

If z = uTuples Then 

Sbeets(overallQEnbopy).Cells(counter, z + 3) = '-SUM(" & col(2) & counter _ 

& & col(nTuples + 2) & counter & ")" 

End If 

If z = uTuples Then 

'S(q) 

Sbeets(contributionEntropySbeet).Cells(counter, z + 3) = '-SUM(" & col(2) & counter _ 

& & col(nTuples + 2) & counter & ")" 

Sbeets(contributionEnbopySbeet).Cells(counter, z + 5) = _ 
Sbeets(contributionEnbopySbeet).Cells(counter, z + 3).Value 

'n(q) 

Sbeets(contributionEnbopySbeet).Cells(counter, z + 4) = _ 
Sbeets("q_summary_montbly_instances").Cells(counter, z + 3) 

Sbeets(contributionEnbopySbeet).Cells(counter, z + 6) = _ 
Sbeets(contributionEnbopySbeet).Cells(counter, z + 4).Value 

If counter > 4 Then 'calc tbe delta values and temp 
'delta s 

Sbeets(contributionEnbopySbeet).Cells(counter, z + 7) = _ 

& col(z + 5) & counter & & col(z + 5) & counter - 1 

'delta n 

Sbeets(contributionEntropySbeet).Cells(counter, z + 8) = _ 

& col(z + 6) & counter & & col(z + 6) & counter - 1 
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'temp delta n / delta s 

Slieets(contributionEiitropySlieet).Cells(couiiter, z + 9) = 
& col(z + 8) & counter & "/" & col(z + 7) & counter 

End If 

End If 'z = nTuples 


counter = counter + 1 


Next k 
Next j 

Next z 

'sheet header: 

Sheets(contributionEntropySheet).Cells(2, nTuples + 3) = "S(q)" 
Sheets(contributionEntropySheet).Cells(2, nTuples + 4) = "n(q)" 
Sheets(contributionEnbopySheet).Cells(l, nTuples + 5) = "After linear interpolation" 
Sheets(contributionEnbopySheet).Cells(2, nTuples + 5) = "S(q)" 
Sheets(contributionEnbopySheet).Cells(2, nTuples + 6) = "n(q)" 
Sheets(contributionEnbopySheet).Cells(2, nTuples + 7) = "delta S" 
Sheets(contributionEnbopySheet).Cells(2, nTuples + 8) = "delta n" 
Sheets(contributionEntropySheet).Cells(2, nTuples + 9) = "temp (n/s)" 


'run linear interpolation on S(q) and n(q) 

Call FillIn]VIissingData(contributionEnbopySheet, nTuples + 5, 4, counter - 1) 
Call FillIn]VIissingData(contributionEnbopySheet, nTuples + 6, 4, counter - 1) 

End Sub' qlevelEntropy 


sub: qLevelMonthTemp 
Author: Matt Behnke 
Created: 2/28/02 

Description: uses the counts from the q level month sheet to: 

1) store the instances of each time step into an array 

2) calculates the probabilities of each instance 

3) determines the x & y values 

4) determines the alpha, beta values from the curvefit 

5) outputs timesteps, and alpha and beta onto a new sheet 
coeff(l) = A 

coeff(2) = B 
alpha = B 

beta = e''(-A/alpha) 

inputs: numqLevels - the number of qlevels nTuples.. 

Outputs: 


Sub qLevelMonthTemp(ByVal numQLevels As Integer) 

'number of ntuples 
nTuples = numQLevels 
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Dim instances_q(64) As Variant 
Dim probabilities(64) As Variant 
Dim x_values(64) As Variant 
Dim y_values(64) As Variant 
Dim coefficients As Variant 
Dim numRows As Integer 
Dim numlnstances k As Double 
Dim gamma As Double 
Dim numValues As Integer 


'stores the instances in each c|_level c|_i_k 
'stores the probabilities of instances P(q_i_k) 
'x values [X: ln(qi+r)] -weibull 
'y values [Y: ln[-ln(l-P(qi)] -weibull 
'coefficients 

'number of rows on the datasheet 
'total num of instances at step k 
'the shift, r 

'the number of values in the x,y arrays 


gamma = 0# 

'the source datasheet contains the timesteps and the count of instances in each q level per time step.. 
datasheet2 = "q_summary_monthly_instances" 
tempSheet = "q_level_monthly_temp" 

numRows = CountRows(datasheet2, 1) 
numValues = nTuples 

Sheets. Add After~Sheets(Sheets.Count) 

'Sheets(Sheets.Count). Select 
ActiveSheet.Name = tempSheet 

Cells(4, 2). Select 

ActiveWindow.FreezePanes = Trae 


Sheets(temp Sheet).Cells(l, 1) 
Sheets(tempSheet).Cells(2, 1) 
Sheets(temp Sheet).Cells(3, 1) 
Sheets(tempSheet).Cells(3, 2) 
Sheets(tempSheet).Cells(3, 3) 
Sheets(temp Sheet).Cells(3,4) 
Sheets(tempSheet).Cells(3, 5) 
Sheets(tempSheet).Cells(3, 6) 
Sheets(tempSheet).Cells(2, 6) 


= "k" 

= "interval" 

= "A" 

= "B" 

= "alpha = B" 

= "beta = e''(-A/alpha)" 
= "T = beta" 


'numlnstances k = Sheets(datasheet2).Cells(l, 1) 'THe total num of instances over the whole dataset 


For k = 4 To numRows 'traverse all the steps 

numlnstances k = Sheets(datasheet2).Cells(k, nTuples + 3) 'columns are offset by 2 
Sheets(tempSheet).Cells(k, 1) = k - 3 'timestep 

Sheets(tempSheet).Cells(k, 2) = Sheets(datasheet2).Cells(k, 1) 'interval 


num_q_at_k = 0 


For z = 1 To nTuples 'traverse all the nTuples 

instances_q(z) = Sheets(datasheet2).Cells(k, z -l- 2) 
probabilities(z) = 0 'reinit array 
If instances_q(z) > 0 Then 

probabihties(z) = instances_q(z) / numlnstances k 
'In = log(x) / log(exp(l)) 

x_values(z) = Log(instances_q(z) -I- gamma) / Log(Exp(l)) 'Ln(instances_q(z) -I- gamma) 
y_values(z) = Log((-Log(l - probabihties(z)) / Log(Exp(l)))) / Log(Exp(l)) 'Ln(-Ln(l - 
probabilities(z))) 
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num_q_at_k = num_q_at_k + 1 
'DEBUG********8 

Sheets(temp Sheet).Cells(3, 10) = "aetual probabilities" 
Sheets(tempSheet).Cells(k, z + 9) = probabilities(z) 


Else 

End If 
Next z 

If num_q_at_k > 0 Then 

eoeffleients = eurveFit(num_q_at_k, x values, y values, I) 


Sheets(tempSheet).CeIls(k, 3) = eoeffieients(I) 'A 
Sheets(tempSheet).Cells(k, 4) = eoeffieients(2) 'B 
Sheets(tempSheet).Cells(k, 5) = eoeffieients(2) 'alpha = B 
If Not eoeffieients(2) = 0 Then 

Sheets(tempSheet).Cells(k, 6) = Exp(-eoeffieients(l) / eoeffieients(2)) 'beta 
Else 

Sheets(tempSheet).Cells(k, 6) = 0 'beta 
End If 

End If 'q at k > 0 
Next k 

'debug::::::: 

For k = 4 To numRows 
For z = 1 To nTuples 

instanees_q(z) = Sheets(datasheet2).Cells(k, z + 2) 

If instanees_q(z) > 0 Then 

'DEBIJG************** 

» 

' P(cLi) ^ l-e'^(q_i/beta)''alpha 
» 

If Sheets(tempSheet).Cells(k, 6) > 0 Then 
Dim p_q_i_ealeed As Double 
Dim beta As Double 
Dim alpha As Double 
beta = Sheets(tempSheet).Cells(k, 6).Value 
alpha = Sheets(tempSheet).Cells(k, 5).Value 
'Sheets(tempSheet).Cells(l, 1) = instanees_q(z) 

'=(1 -EXP(B3/$M$7)^$L$7)*-1 

I 

'p_q_i_ealeed = 1 - (1 / (Exp(instanees_q(z) / beta) ^ alpha)) 

Sheets(tempSheet).Cells(3, 40) = "ealeulated probabilities" 

Sheets(tempSheet).Cells(k, z + 39) = "=(1-EXP(-" & instanees_q(z) & "/" & beta & ")^" & 

alpha & ")" 

End If 

End If 
Next z 
Next k 

End Sub 
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function: qLevelGraphStart 
Author: Matt Behnke 
Created: 4/26/02 

Description: Determines the starting position for the source data of the 
the charts: n(x), S(x), T(x) - Based on Entropy. 

the function checks a column to see when a value is not null and when 
the value is greater than zero 

only checks the 150 rows.. 

inputs: Sheetname - the name of the sheet the function checks. 
columnNumber - the column number that the function uses 
rowNumber - starting row 

Outputs: integer 


Function qLevelGraphStart(ByVal sheetName As String, ByVal columnNumber As Integer, ByVal 
rowNumber As Integer) 

Dim checkValue As String 

For i = rowNumber To rowNumber + 150 

checkValue = Sheets(sheetName).Cells(i, columnNumber) 

If Not checkValue = "" Then 
qLevelGraphStart = i 
Exit For 
End If 

Next i 

End Function 'qLevelGraphStart 


function: qLevelCopyGraphs 
Author: Matt Behnke 
Created: 4/26/02 

Description: Copies the graphs from the AffiliationMacro sheet 

inputs: GraphStart - the starting row of the source data.. 
numQLevels - the number of q levels 

Outputs: none 


Sub qLevelCopyGraphs(ByVal graphStart As Integer, ByVal numQLevels As Integer) 

Dim theFilename As String 
Dim macroFilename As String 

Dim copyAfter As String 'the sheet to copy the graphs after 

Dim lastRow As Integer 
Dim dataOffset As Integer 

dataOffset = numQLevels + 2 

theFilename = Application. ActiveWorkbook.Name 
macroFilename = "AffdiationMacro.xls" 
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copy After = "c|_level_monthly_temp" 
lastRow = CountRows(copyAfter, 1) 

Application.Display Alerts = False 

W indows(macroFilename) .Activate 
Slieets("T(X) - Weibull Curve Fit").Select 

Slieets("T(X) - Weibull Curve Fit").CopyAfler~Workbooks(tlieFilename).Slieets(copyAfter) 
ActiveChart.PlotArea. Select 

ActiveChart.SeriesCollection(l).Values = '-ci_level_monthly_temp!R" & graphStart & "C6:R" & 
lastRow & "C6" 

Windows("Affiliation]Vlacro.xls").Activate 
Sheets("S(q)"). Select 

Slieets("S(q)").Copy After~Workbooks(tlieFilename).Sheets(copyAfter) 

ActiveChart.PlotArea. Select 
ActiveChart.SeriesCollection(l). Values = _ 

'-q_contribution_entropy monthly!R" & graphStart & "C" & dataOffset + 3 & ":R" & lastRow & 
"C" & dataOffset + 3 

Windows("Affiliation]Vlacro.xls").Activate 
Sheets("n(q)"). Select 

Sheets("n(q)").Copy After~Workbooks(theFilename).Sheets(copyAfter) 

ActiveChart.PlotArea. Select 
ActiveChart.SeriesCollection(l). Values = _ 

'-q_contribution_entropy monthly!R" & graphStart & "C" & dataOffset + 4 & ":R" & lastRow & 
"C" & dataOffset + 4 

Windows("Affdiation]Vlacro.xls").Activate 
Sheets("T(X) - Based on Entropy").Select 

Sheets("T(X) - Based on Entropy").Copy After~Workbooks(theFilename).Sheets(copyAfter) 
Sheets("T(X) - Based on Entropy").Select 
ActiveChart.PlotArea. Select 
ActiveChart.SeriesCollection(l).Values = _ 

'-q_contribution_entropy monthly!R" & graphStart & "C" & dataOffset + 7 & ":R" & lastRow & 
"C" & dataOffset + 7 
ActiveChart.Axes(xlValue). Select 


Application.Display Alerts = True 
End Sub 


function: qLevelCopyYearsGraph 
Author: Matt Behnke 
Created: 4/26/02 

Description: Copies the graphs from the AffiliationMacro sheet 

inputs: GraphStart - the starting row of the source data.. 
numQLevels - the number of q levels 

Outputs: none 


Sub qLevelCopyYearsGraphs(ByVal numQLevels As Integer) 
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'the sheet to copy the graphs after 


Dim theFilename As String 
Dim macroFilename As String 
Dim copyAfter As String 
Dim lastRow As Integer 
Dim dataOffset As Integer 

dataOffset = numQLevels + 2 'last column of data 

theFilename = Application. ActiveWorkbook.Name 
macroFilename = "AffdiationMacro.xls" 
copyAfter = "q_summary_year" 
lastRow = CountRows(copyAfter, 1) 

Application. Display Alerts = False 

Windows(macroFilename).Activate 
Sheets("c|_level_instances_year"). Select 

Sheets("c|_level_instances_year").Copy After~Workbooks(theFilename).Sheets(copyAfter) 
Application. Display Alerts = Trae 
ActiveChart.PlotArea. Select 
'add the series for each year 

ActiveChart.SeriesCollection(l).Name = & Sheets(copyAfter).Cells(4, 1) & """" 

ActiveChart.SeriesCollection(l).Values = & copyAfter & "!R" & 4 & "C2:R" & 4 & "C" & 

dataOffset 

ActiveChart.SeriesCollection(l). Select 
With Selection 
.Smooth = True 
End With 


For i = 5 To lastRow 'offset is 3.. 5 - 3 = 2 
ActiveChart.SeriesCollection.NewSeries 

ActiveChart.SeriesCollection(i - 3).Name = & Sheets(copyAfter).Cells(i, 1) & """" 

ActiveChart.SeriesCollection(i - 3).Values = & copyAfter & "!R" & i & "C2:R" & i & "C" & 

dataOffset 

ActiveChart.SeriesCollection(i - 3).Select 
With Selection 
.Smooth = Tme 
End With 
Next i 

End Sub 'copy years chart 
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APPENDIX Q - LIST OF TECHNOLOGY TRANSFER MODELS 


Below is a screen shot from Dr. Saboe’s dissertation; 
indication that the model proposed in this research addresses that feature, l-ach of these 
models in (able I are siimmari/ed in the following sections. 


Modelln Tech Tx Literature 

M odel Feature 

Proposed 

Inform ation/C o ntrol 
Theory M odel 

Theory of Hum an Needs 

Model 

Complexity factor 
f ra m e w o rk 
fa c ts . p e rc e p t io n s . 

L e a rn in g Curve 

Actions on messages 
(tasks) 

S tru cture Changes Model- 

Internal and E x te rn a 1 

R e la 1 io n s h Ip 

Shannon Entropy of 

M essages 

J o In t entropy 

In fo r m a tio n In . 

Techno logy Model 

Goodness of 

Techno logy Alone 
causes Diffusion 

Id e n tifie s M in im u m 
num ber of nodes 
(senders and 
race ptors) 
extensions may 
address vacuum and 

Institution Building Model 

External Influences 
a ffe etthe human 
behaviorto assimilate a 
te c h n o log y 

Identifies Entropy as a 
fa c to r th a t can 
in flu e n ce th e 
acceptance of a 

1 a .■* h n r. 1 n n V 

Equilibrium vs Conflict Model 

E q u ilib riu m is an 
Instrument for Balance 
Conflict Is a Instrument 
to a p p ly P re ss u re 

Convergence in 

Entropy yields balance 

Large D iffe re n c e s in 

E n tro p y y ie Id s 

Communication Model 

A Techno logy is 

Delivered to Adopters 
Through a Channel. If 
Understood It is acted 
upon. 

O u a n tifie s t h e 

In fo rm a tio n Being 

Acted on. and 

Q u a n tif ie s n o tio n of 
‘Understood ‘in terms 
of Entropy and 

Problem solving Model 

Present Hypothesis 

Test Hypothesis with 

Data and Logic 

—ilM 


l abic II-l I cchnology I raiisIVr Models. Features, and Relation to Proposed 

Model 
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